3 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Garoff, Henrik 

Liljestrom, Peter 

(ii) TITLE OF INVENTION: DNA Expression Systems Based on 
Alphaviruses 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Birch, Stewart, Kolasch & Birch 

(B) STREET: P.O. Box 747 

(C) CITY: Falls Church 

(D) STATE: Virginia 
E (E) COUNTRY: USA 

V (F) ZIP: 22040-0747 



B (v) COMPUTER READABLE FORM: 

h* (A) MEDIUM TYPE: Floppy disk 

h* (B) COMPUTER: IBM PC compatible 

p (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

fp (D) SOFTWARE: Patentln Release #1.0, Version #1.25 

rs (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/920,281 
J (B) FILING DATE: 13-AUG-1992 

J! (C) CLASSIFICATION: 

ffviii) ATTORNEY/ AGENT INFORMATION: 

^ (A) NAME: Murphy Jr., Gerald M. 

(B) REGISTRATION NUMBER: 28,977 

(C) REFERENCE/DOCKET NUMBER: 828-103P 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 703-241-1300 

(B) TELEFAX: 703-241-2848 

(C) TELEX: 248345 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11517 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 



) 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Semliki Forest Virus 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1.. 11517 

(D) OTHER INFORMATION: /label= genome 

/note= "Semliki Forest Virus complete nucleotide 
sequence, presented as a cloned DNA sequence; see 
Figure 5 . " 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 87.. 7379 

(D) OTHER INFORMATION: /product= "SFV polyprotein" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 7421.. 11179 

(D) OTHER INFORMATION: /product = "SFV polyprotein" 



^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

cMtggcggat gtgtgacata cacgacgcca AAAGATTTTG TTCCAGCTCC TGCCACCTCC 60 

GpTACGCGAG AGATTAACCA CCCACG ATG GCC GCC AAA GTG CAT GTT GAT ATT 113 
O Met Ala Ala Lys Val His Val Asp lie 

m 1 5 

GAG GCT GAC AGC CCA TTC ATC AAG TCT TTG CAG AAG GCA TTT CCG TCG 161 
Glu Ala Asp Ser Pro Phe lie Lys Ser Leu Gin Lys Ala Phe Pro Ser 
10 15 20 25 

TTC GAG GTG GAG TCA TTG CAG GTC ACA CCA AAT GAC CAT GCA AAT GCC 209 
Phe Glu Val Glu Ser Leu Gin Val Thr Pro Asn Asp His Ala Asn Ala 

30 35 40 

AGA GCA TTT TCG CAC CTG GCT ACC AAA TTG ATC GAG CAG GAG ACT GAC 257 
Arg Ala Phe Ser His Leu Ala Thr Lys Leu lie Glu Gin Glu Thr Asp 
45 50 55 



AAA GAC ACA CTC ATC TTG GAT ATC GGC AGT GCG CCT TCC AGG AGA ATG 3 05 

Lys Asp Thr Leu lie Leu Asp lie Gly Ser Ala Pro Ser Arg Arg Met 
60 65 70 



I 
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ATG TCT ACG CAC AAA TAC CAC TGC GTA TGC CCT ATG CGC AGC GCA GAA 3 53 

Met Ser Thr His Lys Tyr His Cys Val Cys Pro Met Arg Ser Ala Glu 
75 80 85 

GAC CCC GAA AGG CTC GAT AGC TAC GCA AAG AAA CTG GCA GCG GCC TCC 4 01 

Asp Pro Glu Arg Leu Asp Ser Tyr Ala Lys Lys Leu Ala Ala Ala Ser 
- 90 *95 100 105 

GGG AAG GTG CTG GAT AGA GAG ATC GCA GGA AAA ATC ACC GAC CTG CAG 449 
Gly Lys Val Leu Asp Arg Glu lie Ala Gly Lys lie Thr Asp Leu Gin 

110 115 120 

ACC GTC ATG GCT ACG CCA GAC GCT GAA TCT CCT ACC TTT TGC CTG CAT 497 
Thr Val Met Ala Thr Pro Asp Ala Glu Ser Pro Thr Phe Cys Leu His 
125 130 135 

ACA GAC GTC ACG TGT CGT ACG GCA GCC GAA GTG GCC GTA TAC CAG GAC 545 
ffihr Asp Val Thr Cys Arg Thr Ala Ala Glu Val Ala Val Tyr Gin Asp 
3 140 145 150 

JStG TAT GCT GTA CAT GCA CCA ACA TCG CTG TAC CAT CAG GCG ATG AAA 593 
jsgal Tyr Ala Val His Ala Pro Thr Ser Leu Tyr His Gin Ala Met Lys 

f'7 155 160 165 

SB;;;!; 

SGT GTC AGA ACG GCG TAT TGG ATT GGG TTT GAC ACC ACC CCG TTT ATG 641 
Hily Val Arg Thr Ala Tyr Trp lie Gly Phe Asp Thr Thr Pro Phe Met 
!170 175 180 185 



y TT GAC QCG CTA QCA GQC GCQ TAT CCA ACC TAC GCC ACA AAC TGQ GCC 689 

l#he Asp Ala Leu Ala Gly Ala Tyr Pro Thr Tyr Ala Thr Asn Trp Ala 
Q 190 195 200 



M&AC GAG CAG GTG TTA CAG GCC AGG AAC ATA GGA CTG TGT GCA GCA TCC 73 7 

Asp Glu Gin Val Leu Gin Ala Arg Asn lie Gly Leu Cys Ala Ala Ser 
205 210 215 

TTG ACT GAG GGA AGA CTC GGC AAA CTG TCC ATT CTC CGC AAG AAG CAA 785 
Leu Thr Glu Gly Arg Leu Gly Lys Leu Ser lie Leu Arg Lys Lys Gin 
220 225 230 

TTG AAA CCT TGC GAC ACA GTC ATG TTC TCG GTA GGA TCT ACA TTG TAC 833 
Leu Lys Pro Cys Asp Thr Val Met Phe Ser Val Gly Ser Thr Leu Tyr 
235 240 245 

ACT GAG AGC AGA AAG CTA CTG AGG AGC TGG CAC TTA CCC TCC GTA TTC 881 
Thr Glu Ser Arg Lys Leu Leu Arg Ser Trp His Leu Pro Ser Val Phe 
250 255 260 265 

CAC CTG AAA GGT AAA CAA TCC TTT ACC TGT AGG TGC GAT ACC ATC GTA 92 9 

His Leu Lys Gly Lys Gin Ser Phe Thr Cys Arg Cys Asp Thr lie Val 

270 275 280 
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TCA 
Ser 


TGT 
Cys 


GAA 
Glu 


GGG 
Gly 
285 


TAC 
Tyr 


GTA 
Val 


GTT 
Val 


AAG 
Lys 


AAA 
Lys 
290 


ATC 
He 


ACT 
Thr 


ATG 

Met 


TGC 
Cys 


CCC 
Pro 
295 


GGC 
Gly 


CTG 
Leu 


977 


TAC 
Tyr 


GGT 
Gly 


AAA 
Lys 
300 


ACG 
Thr 


GTA 
Val 


GGG 
Gly 


TAC 
Tyr 


GCC 
Ala 
305 


GTG 
Val 


ACG 
Thr 


TAT 
Tyr 


CAC 
His 


GCG 
Ala 
310 


GAG 

Glu 


GGA 
Gly 


TTC 

Phe 


1025 


CTA 
Leu 


GTG 
Val 
315 


TGC 
Cys 


AAG 
Lys 


ACC 
Thr 


ACA 
Thr 


GAC 
Asp 
320 


ACT 
Thr 


GTC 
Val 


AAA 

Lys 


GGA 

Gly 


GAA 

Glu 
325 


AGA 
Arg 


GTC 
Val 


TCA 
Ser 


TTC 
Phe 


1073 


CCT 
Pro 
330 


GTA 
Val 


TGC 
Cys 


ACC 
Thr 


TAC 
Tyr 


GTC 
Val 
335 


CCC 
Pro 


TCA 
Ser 


ACC 
Thr 


ATC 

He 


TGT 
Cys 
340 


GAT 
Asp 


CAA 
Gin 


ATG 
Met 


ACT 
Thr 


GGC 
Gly 
345 


1121 


ATA 

lie 


CTA 
Leu 


GCG 
Ala 


ACC 
Thr 


GAC 
Asp 
350 


GTC 
Val 


ACA 
Thr 


CCG 
Pro 


GAG 
Glu 


GAC 
Asp 
355 


GCA 
Ala 


CAG 
Gin 


AAG 

Lys 


TTG 
Leu 


TTA 

Leu 
360 


GTG 
Val 


1169 


<3§A 


TTG 

Leu 


AAT 
Asn 


CAG 
Gin 
365 


AGG 
Arg 


ATA 

He 


GTT 
Val 


GTG 
Val 


AAC 
Asn 
370 


GGA 
Gly 


AGA 

Arg 


ACA 

Thr 


CAG 
Gin 


CGA 
Arg 
375 


AAC 
Asn 


ACT 
Thr 


1217 


ASn 


ACG 
Thr 


ATG 
Met 
380 


AAG 
Lys 


AAC 
Asn 


TAT 
Tyr 


CTG 
Leu 


CTT 
Leu 
385 


CCG 
Pro 


ATT 

He 


GTG 
Val 


GCC 
Ala 


GTC 
Val 
390 


GCA 
Ala 


TTT 
Phe 


AGC 
Ser 


1265 


aSg 

laics 


TGG 
Trp 
395 


GCG 
Ala 


AGG 
Arg 


GAA 
Glu 


TAC 
Tyr 


AAG 
Lys 
400 


GCA 
Ala 


GAC 
Asp 


CTT 
Leu 


GAT 
Asp 


GAT 
Asp 
405 


GAA 
Glu 


AAA 
Lys 


CCT 
Pro 


CTG 
Leu 


1313 


dly 
410 


GTC 
Val 


CGA 
Arg 


GAG 

Glu 


AGG 
Arg 


TCA 
Ser 
415 


CTT 
Leu 


ACT 
Thr 


TGC 
Cys 


TGC 
Cys 


TGC 
Cys 
420 


TTG 

Leu 


TGG 
Trp 


GCA 
Ala 


TTT 

Phe 


AAA 

Lys 
425 


1361 


ACG 
Thr 


AGG 
Arg 


AAG 
Lys 


ATG 
Met 


CAC 
His 
430 


ACC 
Thr 


ATG 
Met 


TAC 
Tyr 


AAG 
Lys 


AAA 
Lys 
435 


CCA 
Pro 


GAC 
Asp 


ACC 
Thr 


CAG 
Gin 


ACA 
Thr 
440 


ATA 

He 


1409 


GTG 
Val 


AAG 
Lys 


GTG 
Val 


CCT 
Pro 
445 


TCA 
Ser 


GAG 
Glu 


TTT 
Phe 


AAC 
Asn 


TCG 
Ser 

t D u 


TTC 
Phe 


GTC 
Val 


ATC 
He 


CCG 
Pro 


AGC 
Ser 
455 


CTA 
Leu 


TGG 
Trp 


1457 


TCT 
Ser 


ACA 
Thr 


GGC 
Gly 

460 


CTC 
Leu 


GCA 
Ala 


ATC 
He 


CCA 
Pro 


GTC 
Val 
465 


AGA 
Arg 


TCA 
Ser 


CGC 
Arg 


ATT 
He 


AAG 
Lys 
470 


ATG 
Met 


CTT 
Leu 


TTG 
Leu 


1505 


GCC 
Ala 


AAG 
Lys 
475 


AAG 
Lys 


ACC 
Thr 


AAG 
Lys 


CGA 
Arg 


GAG 
Glu 
480 


TTA 
Leu 


ATA 
He 


CCT 
Pro 


GTT 
Val 


CTC 
Leu 
485 


GAC 
Asp 


GCG 
Ala 


TCG 
Ser 


TCA 
Ser 


1553 



56 



GCC AGG GAT GCT GAA CAA GAG GAG AAG GAG AGG TTG GAG GCC GAG CTG 1601 
Ala Arg Asp Ala Glu Gin Glu Glu Lys Glu Arg Leu Glu Ala Glu Leu 
490 495 500 505 

ACT AGA GAA GCC TTA CCA CCC CTC GTC CCC ATC GCG CCG GCG GAG ACG 164 9 

Thr Arg Glu Ala Leu Pro Pro Leu Val Pro lie Ala Pro Ala Glu Thr 

510 ' 515 520 

GGA GTC GTC GAC GTC GAC GTT GAA GAA CTA GAG TAT CAC GCA GGT GCA 1697 
Gly Val Val Asp Val Asp Val Glu Glu Leu Glu Tyr His Ala Gly Ala 
525 530 535 

GGG GTC GTG GAA ACA CCT CGC AGC GCG TTG AAA GTC ACC GCA CAG CCG 1745 
Gly Val Val Glu Thr Pro Arg Ser Ala Leu Lys Val Thr Ala Gin Pro 
540 545 550 

AAC GAC GTA CTA CTA GGA AAT TAC GTA GTT CTG TCC CCG CAG ACC GTG 1793 

Ajsn Asp Val Leu Leu Gly Asn Tyr Val Val Leu Ser Pro Gin Thr Val 

r =r; 555 560 565 

CfC AAG AGC TCC AAG TTG GCC CCC GTG CAC CCT CTA GCA GAG CAG GTG 1841 

iliu Lys Ser Ser Lys Leu Ala Pro Val His Pro Leu Ala Glu Gin Val 

S70 575 580 585 

ASA ATA ATA ACA CAT AAC GGG AGG GCC GGC GGT TAC CAG GTC GAC GGA 1889 

Igls lie lie Thr His Asn Gly Arg Ala Gly Gly Tyr Gin Val Asp Gly 
* 590 595 600 

TIT GAC GGC AGG GTC CTA CTA CCA TGT GGA TCG GCC ATT CCG GTC CCT 1937 

Tiyr Asp Gly Arg Val Leu Leu Pro Cys Gly Ser Ala lie Pro Val Pro 
k 605 610 615 

(Mg TTT CAA GCT TTG AGC GAG AGC GCC ACT ATG GTG TAC AAC GAA AGG 1985 

(flu Phe Gin Ala Leu Ser Glu Ser Ala Thr Met Val Tyr Asn Glu Arg 
620 625 630 

GAG TTC GTC AAC AGG AAA CTA TAC CAT ATT GCC GTT CAC GGA CCG TCG 2 033 

Glu Phe Val Asn Arg Lys Leu Tyr His lie Ala Val His Gly Pro Ser 
635 640 645 

CTG AAC ACC GAC GAG GAG AAC TAC GAG AAA GTC AGA GCT GAA AGA ACT 2081 
Leu Asn Thr Asp Glu Glu Asn Tyr Glu Lys Val Arg Ala Glu Arg Thr 
650 655 660 665 

GAC GCC GAG TAC GTG TTC GAC GTA GAT AAA AAA TGC TGC GTC AAG AGA 2129 
Asp Ala Glu Tyr Val Phe Asp Val Asp Lys Lys Cys Cys Val Lys Arg 

670 675 680 

GAG GAA GCG TCG GGT TTG GTG TTG GTG GGA GAG CTA ACC AAC CCC CCG 2177 
Glu Glu Ala Ser Gly Leu Val Leu Val Gly Glu Leu Thr Asn Pro Pro 
685 690 695 
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TTC CAT GAA TTC GCC TAC GAA GGG CTG AAG ATC AGG CCG TCG GCA CCA 2225 
Phe His Glu Phe Ala Tyr Glu Gly Leu Lys He Arg Pro Ser Ala Pro 
700 705 710 

TAT AAG ACT ACA GTA GTA GGA GTC TTT GGG GTT CCG GGA TCA GGC AAG 2273 
Tyr Lys Thr Thr Val Val Gly Val Phe Gly Val Pro Gly Ser Gly Lys 
715 720 725 

TCT GCT ATT ATT AAG AGC CTC GTG ACC AAA CAC GAT CTG GTC ACC AGC 23 21 

Ser Ala He He Lys Ser Leu Val Thr Lys His Asp Leu Val Thr Ser 
730 735 740 745 

GGC AAG AAG GAG AAC TGC CAG GAA ATA GTT AAC GAC GTG AAG AAG CAC 2369 
Gly Lys Lys Glu Asn Cys Gin Glu He Val Asn Asp Val Lys Lys His 

750 755 760 

CGC GGG AAG GGG ACA AGT AGG GAA AAC AGT GAC TCC ATC CTG CTA AAC 2417 
Arg Gly Lys Gly Thr Ser Arg Glu Asn Ser Asp Ser He Leu Leu Asn 
O 765 770 775 

#GG TGT CGT CGT GCC GTG GAC ATC CTA TAT GTG GAC GAG GCT TTC GCT 24 6 5 

inly Cys Arg Arg Ala Val Asp He Leu Tyr Val Asp Glu Ala Phe Ala 

M-~ 780 785 790 

fJGC CAT TCC GGT ACT CTG CTG GCC CTA ATT GCT CTT GTT AAA CCT CGG 2513 
flllys His Ser Gly Thr Leu Leu Ala Leu He Ala Leu Val Lys Pro Arg 
*" 795 800 805 

&C AAA GTG GTG TTA TGC GGA GAC CCC AAG CAA TGC GGA TTC TTC AAT 2561 
Jer Lys Val Val Leu Cys Gly Asp Pro Lys Gin Cys Gly Phe Phe Asn 
JIlO 815 820 825 

HTG ATG CAG CTT AAG GTG AAC TTC AAC CAC AAC ATC TGC ACT GAA GTA 260 9 

Wet Met Gin Leu Lys Val Asn Phe Asn His Asn He Cys Thr Glu Val 

830 835 840 

TGT CAT AAA AGT ATA TCC AGA CGT TGC ACG CGT CCA GTC ACG GCC ATC 2657 
Cys His Lys Ser He Ser Arg Arg Cys Thr Arg Pro Val Thr Ala He 
845 850 855 

GTG TCT ACG TTG CAC TAC GGA GGC AAG ATG CGC ACG ACC AAC CCG TGC 2705 
Val Ser Thr Leu His Tyr Gly Gly Lys Met Arg Thr Thr Asn Pro Cys 
860 865 870 

AAC AAA CCC ATA ATC ATA GAC ACC ACA GGA CAG ACC AAG CCC AAG CCA 2753 
Asn Lys Pro He He He Asp Thr Thr Gly Gin Thr Lys Pro Lys Pro 
875 880 885 

GGA GAC ATC GTG TTA ACA TGC TTC CGA GGC TGG GCA AAG CAG CTG CAG 2801 
Gly Asp He Val Leu Thr Cys Phe Arg Gly Trp Ala Lys Gin Leu Gin 
890 895 900 905 
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TTG GAG TAC CGT GGA CAC GAA GTC ATG ACA GCA GCA GCA TCT CAG GGC 2849 
Leu Asp Tyr Arg Gly His Glu Val Met Thr Ala Ala Ala Ser Gin Gly 

910 915 920 

CTC ACC CGC AAA GGG GTA TAC GCC GTA AGG CAG AAG GTG AAT GAA AAT 2897 
Leu Thr Arg Lys Gly Val Tyr Ala Val Arg Gin Lys Val Asn Glu Asn 
925 "* 930 935 

CCC TTG TAT GCC CCT GCG TCG GAG CAC GTG AAT GTA CTG CTG ACG CGC 2 945 

Pro Leu Tyr Ala Pro Ala Ser Glu His Val Asn Val Leu Leu Thr Arg 
940 945 950 

ACT GAG GAT AGG CTG GTG TGG AAA ACG CTG GCC GGC GAT CCC TGG ATT 2 9 93 

Thr Glu Asp Arg Leu Val Trp Lys Thr Leu Ala Gly Asp Pro Trp lie 
955 960 965 

AAG GTC CTA TCA AAC ATT CCA CAG GGT AAC TTT ACG GCC ACA TTG GAA 3 041 

Ljgs Val Leu Ser Asn lie Pro Gin Gly Asn Phe Thr Ala Thr Leu Glu 
9^0 975 980 985 

gIa TGG CAA GAA GAA CAC GAC AAA ATA ATG AAG GTG ATT GAA GGA CCG 3089 
dSu Trp Gin Glu Glu His Asp Lys lie Met Lys Val lie Glu Gly Pro 
H : 990 995 1000 



GCG CCT GTG GAC GCG TTC CAG AAC AAA GCG AAC GTG TGT TGG GCG 313 7 

flifSa Ala Pro Val Asp Ala Phe Gin Asn Lys Ala Asn Val Cys Trp Ala 
T 1005 1010 1015 



A|A AGC CTG GTG CCT GTC CTG GAC ACT GCC GGA ATC AGA TTG ACA GCA 3185 
lays Ser Leu Val Pro Val Leu Asp Thr Ala Gly lie Arg Leu Thr Ala 
1020 1025 1030 

GAG GAG TGG AGC ACC ATA ATT ACA GCA TTT AAG GAG GAC AGA GCT TAC 3233 
Glu Glu Trp Ser Thr lie lie Thr Ala Phe Lys Glu Asp Arg Ala Tyr 
1035 1040 1045 

TCT CCA GTG GTG GCC TTG AAT GAA ATT TGC ACC AAG TAC TAT GGA GTT 3 281 

Ser Pro Val Val Ala Leu Asn Glu lie Cys Thr Lys Tyr Tyr Gly Val 
1050 1055 1060 1065 

GAC CTG GAC AGT GGC CTG TTT TCT GCC CCG AAG GTG TCC CTG TAT TAC 3329 
Asp Leu Asp Ser Gly Leu Phe Ser Ala Pro Lys Val Ser Leu Tyr Tyr 

1070 1075 1080 

GAG AAC AAC CAC TGG GAT AAC AGA CCT GGT GGA AGG ATG TAT GGA TTC 3377 
Glu Asn Asn His Trp Asp Asn Arg Pro Gly Gly Arg Met Tyr Gly Phe 
1085 1090 1095 

AAT GCC GCA ACA GCT GCC AGG CTG GAA GCT AGA CAT ACC TTC CTG AAG 3 42 5 

Asn Ala Ala Thr Ala Ala Arg Leu Glu Ala Arg His Thr Phe Leu Lys 
1100 1105 1110 
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GGG CAG TGG CAT ACG GGC AAG CAG GCA GTT ATC GCA GAA AGA AAA ATC 3473 
Gly Gin Trp His Thr Gly Lys Gin Ala Val He Ala Glu Arg Lys He 
1115 1120 1125 

CAA CCG CTT TCT GTG CTG GAC AAT GTA ATT CCT ATC AAC CGC AGG CTG 3 521 

Gin Pro Leu Ser Val Leu Asp Asn Val He Pro He Asn Arg Arg Leu 
1130 1135 1140 1145 

CCG CAC GCC CTG GTG GCT GAG TAC AAG ACG GTT AAA GGC AGT AGG GTT 3 569 

Pro His Ala Leu Val Ala Glu Tyr Lys Thr Val Lys Gly Ser Arg Val 

1150 1155 1160 

GAG TGG CTG GTC AAT AAA GTA AGA GGG TAC CAC GTC CTG CTG GTG AGT 3 617 

Glu Trp Leu Val Asn Lys Val Arg Gly Tyr His Val Leu Leu Val Ser 
1165 1170 1175 

GAG TAC AAC CTG GCT TTG CCT CGA CGC AGG GTC ACT TGG TTG TCA CCG 3 6 65 

GAu Tyr Asn Leu Ala Leu Pro Arg Arg Arg Val Thr Trp Leu Ser Pro 
: t: 1180 1185 1190 



AAT GTC ACA GGC GCC GAT AGG TGC TAC GAC CTA AGT TTA GGA CTG 3713 
Erdu Asn Val Thr Gly Ala Asp Arg Cys Tyr Asp Leu Ser Leu Gly Leu 
H ; H95 1200 1205 



(MG GCT GAC GCC GGC AGG TTC GAC TTG GTC TTT GTG AAC ATT CAC ACG 3 761 

J&o Ala Asp Ala Gly Arg Phe Asp Leu Val Phe Val Asn He His Thr 
1210 1215 1220 1225 

(MA TTC AGA ATC CAC CAC TAC CAG CAG TGT GTC GAC CAC GCC ATG AAG 3 809 

(Jlu Phe Arg He His His Tyr Gin Gin Cys Val Asp His Ala Met Lys 
p 1230 1235 1240 

CZTG CAG ATG CTT GGG GGA GAT GCG CTA CGA CTG CTA AAA CCC GGC GGC 3857 
Leu Gin Met Leu Gly Gly Asp Ala Leu Arg Leu Leu Lys Pro Gly Gly 
1245 1250 1255 

ATC TTG ATG AGA GCT TAC GGA TAC GCC GAT AAA ATC AGC GAA GCC GTT 3 905 

lie Leu Met Arg Ala Tyr Gly Tyr Ala Asp Lys lie Ser Glu Ala Val 
1260 1265 1270 

GTT TCC TCC TTA AGC AGA AAG TTC TCG TCT GCA AGA GTG TTG CGC CCG 3953 
Val Ser Ser Leu Ser Arg Lys Phe Ser Ser Ala Arg Val Leu Arg Pro 
1275 1280 1285 

GAT TGT GTC ACC AGC AAT ACA GAA GTG TTC TTG CTG TTC TCC AAC TTT 4001 
Asp Cys Val Thr Ser Asn Thr Glu Val Phe Leu Leu Phe Ser Asn Phe 
1290 1295 1300 1305 

GAC AAC GGA AAG AGA CCC TCT ACG CTA CAC CAG ATG AAT ACC AAG CTG 4049 
Asp Asn Gly Lys Arg Pro Ser Thr Leu His Gin Met Asn Thr Lys Leu 

1310 1315 1320 
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AGT GCC GTG TAT GCC GGA GAA GCC ATG CAC ACG GCC GGG TGT GCA CCA 4097 
Ser Ala Val Tyr Ala Gly Glu Ala Met His Thr Ala Gly Cys Ala Pro 
1325 1330 1335 

TCC TAC AGA GTT AAG AGA GCA GAC ATA GCC ACG TGC ACA GAA GCG GCT 4145 
Ser Tyr Arg Val Lys Arg Ala Asp He Ala Thr Cys Thr Glu Ala Ala 
1340 - 1345 1350 

GTG GTT AAC GCA GCT AAC GCC CGT GGA ACT GTA GGG GAT GGC GTA TGC 4193 
Val Val Asn Ala Ala Asn Ala Arg Gly Thr Val Gly Asp Gly Val Cys 
1355 1360 1365 

AGG GCC GTG GCG AAG AAA TGG CCG TCA GCC TTT AAG GGA GCA GCA ACA 4241 
Arg Ala Val Ala Lys Lys Trp Pro Ser Ala Phe Lys Gly Ala Ala Thr 
1370 1375 1380 1385 

CCA GTG GGC ACA ATT AAA ACA GTC ATG TGC GGC TCG TAC CCC GTC ATC 428 9 

Pro Val Gly Thr He Lys Thr Val Met Cys Gly Ser Tyr Pro Val He 
C 1 1390 1395 1400 

Cjc GCT GTA GCG CCT AAT TTC TCT GCC ACG ACT GAA GCG GAA GGG GAC 4337 
Ms Ala Val Ala Pro Asn Phe Ser Ala Thr Thr Glu Ala Glu Gly Asp 
M 1405 1410 1415 

CpC GAA TTG GCC GCT GTC TAC CGG GCA GTG GCC GCC GAA GTA AAC AGA 43 85 

mg Glu Leu Ala Ala Val Tyr Arg Ala Val Ala Ala Glu Val Asn Arg 
1420 1425 1430 



TCA CTG AGC AGC GTA GCC ATC CCG CTG CTG TCC ACA GGA GTG TTC 4433 
Leu Ser Leu Ser Ser Val Ala He Pro Leu Leu Ser Thr Gly Val Phe 
:I 1435 1440 1445 



GGC GGA AGA GAT AGG CTG CAG CAA TCC CTC AAC CAT CTA TTC ACA 4481 
Ser Gly Gly Arg Asp Arg Leu Gin Gin Ser Leu Asn His Leu Phe Thr 
1450 1455 1460 1465 

GCA ATG GAC GCC ACG GAC GCT GAC GTG ACC ATC TAC TGC AGA GAC AAA 4529 
Ala Met Asp Ala Thr Asp Ala Asp Val Thr He Tyr Cys Arg Asp Lys 

1470 1475 1480 

AGT TGG GAG AAG AAA ATC CAG GAA GCC ATT GAC ATG AGG ACG GCT GTG 4577 
Ser Trp Glu Lys Lys He Gin Glu Ala He Asp Met Arg Thr Ala Val 
1485 1490 1495 

GAG TTG CTC AAT GAT GAC GTG GAG CTG ACC ACA GAC TTG GTG AGA GTG 4625 
Glu Leu Leu Asn Asp Asp Val Glu Leu Thr Thr Asp Leu Val Arg Val 
1500 1505 1510 

CAC CCG GAC AGC AGC CTG GTG GGT CGT AAG GGC TAC AGT ACC ACT GAC 4673 
His Pro Asp Ser Ser Leu Val Gly Arg Lys Gly Tyr Ser Thr Thr Asp 
1515 1520 1525 
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GGG TCG CTG TAC TCG TAC TTT GAA GGT ACG AAA TTC AAC CAG GCT GCT 4 721 

Gly Ser Leu Tyr Ser Tyr Phe Glu Gly Thr Lys Phe Asn Gin Ala Ala 
1530 1535 1540 1545 

ATT GAT ATG GCA GAG ATA CTG ACG TTG TGG CCC AGA CTG CAA GAG GCA 476 9 

lie Asp Met Ala Glu lie Leu Thr Leu Trp Pro Arg Leu Gin Glu Ala 

1550- 1555 1560 

AAC GAA CAG ATA TGC CTA TAC GCG CTG GGC GAA ACA ATG GAC AAC ATC 4 817 

Asn Glu Gin lie Cys Leu Tyr Ala Leu Gly Glu Thr Met Asp Asn lie 
1565 1570 1575 

AGA TCC AAA TGT CCG GTG AAC GAT TCC GAT TCA TCA ACA CCT CCC AGG 4 865 

Arg Ser Lys Cys Pro Val Asn Asp Ser Asp Ser Ser Thr Pro Pro Arg 
1580 1585 1590 

ACA GTG CCC TGC CTG TGC CGC TAC GCA ATG ACA GCA GAA CGG ATC GCC 4 913 

Thr Val Pro Cys Leu Cys Arg Tyr Ala Met Thr Ala Glu Arg lie Ala 
Q 1595 1600 1605 

CPC CTT AGG TCA CAC CAA GTT AAA AGC ATG GTG GTT TGC TCA TCT TTT 4 961 

Ktg Leu Arg Ser His Gin Val Lys Ser Met Val Val Cys Ser Ser Phe 
1610 1615 1620 1625 

|€C CTC CCG AAA TAC CAT GTA GAT GGG GTG CAG AAG GTA AAG TGC GAG 50 0 9 

fro Leu Pro Lys Tyr His Val Asp Gly Val Gin Lys Val Lys Cys Glu 

1630 1635 1640 



GTT CTC CTG TTC GAC CCG ACG GTA CCT TCA GTG GTT AGT CCG CGG 5057 
Lys Val Leu Leu Phe Asp Pro Thr Val Pro Ser Val Val Ser Pro Arg 
!l 1645 1650 1655 

AAG TAT GCC GCA TCT ACG ACG GAC CAC TCA GAT CGG TCG TTA CGA GGG 510 5 

iys Tyr Ala Ala Ser Thr Thr Asp His Ser Asp Arg Ser Leu Arg Gly 
1660 1665 1670 

TTT GAC TTG GAC TGG ACC ACC GAC TCG TCT TCC ACT GCC AGC GAT ACC 5153 
Phe Asp Leu Asp Trp Thr Thr Asp Ser Ser Ser Thr Ala Ser Asp Thr 
1675 1680 1685 

ATG TCG CTA CCC AGT TTG CAG TCG TGT GAC ATC GAC TCG ATC TAC GAG 5201 
Met Ser Leu Pro Ser Leu Gin Ser Cys Asp lie Asp Ser lie Tyr Glu 
1690 1695 1700 1705 

CCA ATG GCT CCC ATA GTA GTG ACG GCT GAC GTA CAC CCT GAA CCC GCA 5249 
Pro Met Ala Pro lie Val Val Thr Ala Asp Val His Pro Glu Pro Ala 

1710 1715 1720 

GGC ATC GCG GAC CTG GCG GCA GAT GTG CAC CCT GAA CCC GCA GAC CAT 52 97 

Gly lie Ala Asp Leu Ala Ala Asp Val His Pro Glu Pro Ala Asp His 
1725 1730 1735 
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GTG GAC CTC GAG AAC CCG ATT CCT CCA CCG CGC CCG AAG AGA GCT GCA 5345 
Val Asp Leu Glu Asn Pro lie Pro Pro Pro Arg Pro Lys Arg Ala Ala 
1740 1745 1750 

TAC CTT GCC TCC CGC GCG GCG GAG CGA CCG GTG CCG GCG CCG AGA AAG 53 93 

Tyr Leu Ala Ser Arg Ala Ala Glu Arg Pro Val Pro Ala Pro Arg Lys 
1755 - 1760 1765 

CCG ACG CCT GCC CCA AGG ACT GCG TTT AGG AAC AAG CTG CCT TTG ACG 5441 
Pro Thr Pro Ala Pro Arg Thr Ala Phe Arg Asn Lys Leu Pro Leu Thr 
1770 1775 1780 1785 

TTC GGC GAC TTT GAC GAG CAC GAG GTC GAT GCG TTG GCC TCC GGG ATT 54 8 9 

Phe Gly Asp Phe Asp Glu His Glu Val Asp Ala Leu Ala Ser Gly lie 

1790 1795 1800 

ACT TTC GGA GAC TTC GAC GAC GTC CTG CGA CTA GGC CGC GCG GGT GCA 553 7 

Thr Phe Gly Asp Phe Asp Asp Val Leu Arg Leu Gly Arg Ala Gly Ala 
C 1805 1810 1815 

IaT ATT TTC TCC TCG GAC ACT GGC AGC GGA CAT TTA CAA CAA AAA TCC 55 85 

fyr He Phe Ser Ser Asp Thr Gly Ser Gly His Leu Gin Gin Lys Ser 
|2 1820 1825 1830 

pTT AGG CAG CAC AAT CTC CAG TGC GCA CAA CTG GAT GCG GTC CAG GAG 563 3 

Sval Arg Gin His Asn Leu Gin Cys Ala Gin Leu Asp Ala Val Gin Glu 
^ 1835 1840 1845 

^AG AAA ATG TAC CCG CCA AAA TTG GAT ACT GAG AGG GAG AAG CTG TTG 56 81 

blu Lys Met Tyr Pro Pro Lys Leu Asp Thr Glu Arg Glu Lys Leu Leu 

J|B50 1855 1860 1865 

KTG CTG AAA ATG CAG ATG CAC CCA TCG GAG GCT AAT AAG AGT CGA TAC 572 9 

|Leu Leu Lys Met Gin Met His Pro Ser Glu Ala Asn Lys Ser Arg Tyr 

1870 1875 1880 

CAG TCT CGC AAA GTG GAG AAC ATG AAA GCC ACG GTG GTG GAC AGG CTC 5777 
Gin Ser Arg Lys Val Glu Asn Met Lys Ala Thr Val Val Asp Arg Leu 
1885 1890 1895 

ACA TCG GGG GCC AGA TTG TAC ACG GGA GQG GAC GTA GGC CGC ATA CCA 5825 
Thr Ser Gly Ala Arg Leu Tyr Thr Gly Ala Asp Val Gly Arg He Pro 
1900 1905 1910 

ACA TAC GCG GTT CGG TAC CCC CGC CCC GTG TAC TCC CCT ACC GTG ATC 5873 
Thr Tyr Ala Val Arg Tyr Pro Arg Pro Val Tyr Ser Pro Thr Val He 
1915 1920 1925 

GAA AGA TTC TCA AGC CCC GAT GTA GCA ATC GCA GCG TGC AAC GAA TAC 5921 
Glu Arg Phe Ser Ser Pro Asp Val Ala He Ala Ala Cys Asn Glu Tyr 
1930 1935 1940 1945 
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CTA TCC AGA AAT TAC CCA ACA GTG GCG TCG TAC CAG ATA ACA GAT GAA 5 969 

Leu Ser Arg Asn Tyr Pro Thr Val Ala Ser Tyr Gin lie Thr Asp Glu 

1950 1955 1960 

TAC GAC GCA TAC TTG GAC ATG GTT GAC GGG TCG GAT AGT TGC TTG GAC 6 017 

Tyr Asp Ala Tyr Leu Asp Met Val Asp Gly Ser Asp Ser Cys Leu Asp 
1965 " 1970 1975 

AGA GCG ACA TTC TGC CCG GCG AAG CTC CGG TGC TAC CCG AAA CAT CAT 606 5 

Arg Ala Thr Phe Cys Pro Ala Lys Leu Arg Cys Tyr Pro Lys His His 
1980 1985 1990 

GCG TAC CAC CAG CCG ACT GTA CGC AGT GCC GTC CCG TCA CCC TTT CAG 6113 
Ala Tyr His Gin Pro Thr Val Arg Ser Ala Val Pro Ser Pro Phe Gin 
1995 2000 2005 

AAC ACA CTA CAG AAC GTG CTA GCG GCC GCC ACC AAG AGA AAC TGC AAC 6161 
Asn Thr Leu Gin Asn Val Leu Ala Ala Ala Thr Lys Arg Asn Cys Asn 
feblO 2015 2020 2025 

fee ACG CAA ATG CGA GAA CTA CCC ACC ATG GAC TCG GCA GTG TTC AAC 6 209 

¥kl Thr Gin Met Arg Glu Leu Pro Thr Met Asp Ser Ala Val Phe Asn 
H 2030 2035 2040 

SfG GAG TGC TTC AAG CGC TAT GCC TGC TCC GGA GAA TAT TGG GAA GAA 6257 
Pal Glu Cys Phe Lys Arg Tyr Ala Cys Ser Gly Glu Tyr Trp Glu Glu 
* 2045 2050 2055 



TAT GCT AAA CAA CCT ATC CGG ATA ACC ACT GAG AAC ATC ACT ACC TAT 63 05 

plyr Ala Lys Gin Pro lie Arg lie Thr Thr Glu Asn lie Thr Thr Tyr 
U 2060 2065 2070 

"Kit::'' 

tjTG ACC AAA TTG AAA GGC CCG AAA GCT GCT GCC TTG TTC GCT AAG ACC 6353 

r Val Thr Lys Leu Lys Gly Pro Lys Ala Ala Ala Leu Phe Ala Lys Thr 
2075 2080 2085 

CAC AAC TTG GTT CCG CTG CAG GAG GTT CCC ATG GAC AGA TTC ACG GTC 64 01 

His Asn Leu Val Pro Leu Gin Glu Val Pro Met Asp Arg Phe Thr Val 
2090 2095 2100 2105 

GAC ATG AAA CGA GAT GTC AAA GTC ACT CCA GGG ACG AAA CAC ACA GAG 644 9 

Asp Met Lys Arg Asp Val Lys Val Thr Pro Gly Thr Lys His Thr Glu 

2110 2115 2120 

GAA AGA CCC AAA GTC CAG GTA ATT CAA GCA GCG GAG CCA TTG GCG ACC 64 97 

Glu Arg Pro Lys Val Gin Val lie Gin Ala Ala Glu Pro Leu Ala Thr 
2125 2130 2135 

GCT TAC CTG TGC GGC ATC CAC AGG GAA TTA GTA AGG AGA CTA AAT GCT 6545 
Ala Tyr Leu Cys Gly lie His Arg Glu Leu Val Arg Arg Leu Asn Ala 
2140 2145 2150 
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GTG TTA CGC CCT AAC GTG CAC ACA TTG TTT GAT ATG TCG GCC GAA GAC 6593 
Val Leu Arg Pro Asn Val His Thr Leu Phe Asp Met Ser Ala Glu Asp 
2155 2160 2165 

TTT GAC GCG ATC ATC GCC TCT CAC TTC CAC CCA GGA GAC CCG GTT CTA 6641 
Phe Asp Ala He He Ala Ser His Phe His Pro Gly Asp Pro Val Leu 
'2170 2175 2180 2185 

GAG ACG GAC ATT GCA TCA TTC GAC AAA AGC CAG GAC GAC TCC TTG GCT 6689 
Glu Thr Asp He Ala Ser Phe Asp Lys Ser Gin Asp Asp Ser Leu Ala 

2190 2195 2200 

CTT ACA GGT TTA ATG ATC CTC GAA GAT CTA GGG GTG GAT CAG TAC CTG 6737 
Leu Thr Gly Leu Met He Leu Glu Asp Leu Gly Val Asp Gin Tyr Leu 
2205 2210 2215 

CTG GAC TTG ATC GAG GCA GCC TTT GGG GAA ATA TCC AGC TGT CAC CTA 6785 
Feu Asp Leu lie Glu Ala Ala Phe Gly Glu He Ser Ser Cys His Leu 
S 2220 2225 2230 

Jca ACT GGC ACG CGC TTC AAG TTC GGA GCT ATG ATG AAA TCG GGC ATG 6833 
Bro Thr Gly Thr Arg Phe Lys Phe Gly Ala Met Met Lys Ser Gly Met 
j7 2235 2240 2245 

SETT CTG ACT TTG TTT ATT AAC ACT GTT TTG AAC ATC ACC ATA GCA AGC 6881 
Phe Leu Thr Leu Phe He Asn Thr Val Leu Asn lie Thr He Ala Ser 
2250 2255 - 2260 2265 

U 

AGG GTA CTG GAG CAG AGA CTC ACT GAC TCC GCC TGT GCG GCC TTC ATC 6929 
krg Val Leu Glu Gin Arg Leu Thr Asp Ser Ala Cys Ala Ala Phe He 
O 2270 2275 2280 

feGC GAC GAC AAC ATC GTT CAC GGA GTG ATC TCC GAC AAG CTG ATG GCG 6977 
Gly Asp Asp Asn He Val His Gly Val He Ser Asp Lys Leu Met Ala 
2285 2290 2295 

GAG AGG TGC GCG TCG TGG GTC AAC ATG GAG GTG AAG ATC ATT GAC GCT 7025 
Glu Arg Cys Ala Ser Trp Val Asn Met Glu Val Lys He He Asp Ala 
2300 2305 2310 

GTC ATG GGC GAA AAA CCC CCA TAT TTT TGT GGG GGA TTC ATA GTT TTT 7073 
Val Met Gly Glu Lys Pro Pro Tyr Phe Cys Gly Gly Phe He Val Phe 
2315 2320 2325 

GAC AGC GTC ACA CAG ACC GCC TGC CGT GTT TCA GAC CCA CTT AAG CGC 7121 
Asp Ser Val Thr Gin Thr Ala Cys Arg Val Ser Asp Pro Leu Lys Arg 
2330 2335 2340 2345 

CTG TTC AAG TTG GGT AAG CCG CTA ACA GCT GAA GAC AAG CAG GAC GAA 716 9 

Leu Phe Lys Leu Gly Lys Pro Leu Thr Ala Glu Asp Lys Gin Asp Glu 

2350 2355 2360 
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GAC AGG iCGA CGA GCA CTG AGT GAC GAG GTT AGC AAG TGG TTC CGG ACA 
Asp Arg krg Arg Ala Leu Ser Asp Glu Val Ser Lys Trp Phe Arg Thr 
2365 2370 2375 



GGC TTG jGGG GCC GAA CTG GAG GTG GCA CTA ACA TCT AGG TAT GAG GTA 
Gly Leu :Gly Ala Glu Leu Glu Val Ala Leu Thr Ser Arg Tyr Glu Val 
2380 ; 2385 2390 

GAG GGC TGC AAA AGT ATC CTC ATA GCC ATG ACC ACC TTG GCG AGG GAC 
Glu Gly Cys Lys Ser lie Leu lie Ala Met Thr Thr Leu Ala Arg Asp 
2393 ' 2400 2405 



ATT AAG GCG TTT AAG AAA TTG AGA GGA CCT GTT ATA CAC CTC TAC GGC 
lie Lys ! Ala Phe Lys Lys Leu Arg Gly Pro Val lie His Leu Tyr Gly 
2410 2415 2420 2425 



GGT CCT 
jGly Pro 



[TGG CGC 
gfrp Arg 
15 



AGA TTG GTG CGT TAATACACAG AATTCTGATT ATAGCGCACT 
Arg Leu Val Arg 
2430 



&TTATAGCAC C ATG AAT TAC ATC CCT ACG CAA ACG TTT TAC GGC CGC CGG 
Met Asn Tyr lie Pro Thr Gin Thr Phe Tyr Gly Arg Arg 
15 10 



CCG CGC CCG GCG GCC CGT CCT TGG CCG TTG CAG GCC ACT CCG 
Pro Arg Pro Ala Ala Arg Pro Trp Pro Leu Gin Ala Thr Pro 

20 25 



:,GTG GCT 
;l/al Ala 
«30 



CCC GTC GTC CCC GAC TTC CAG GCC CAG CAG ATG CAG CAA CTC 
Pro Val Val Pro Asp Phe Gin Ala Gin Gin Met Gin Gin Leu 
35 40 45 



HVTC AGC j GCC GTA AAT GCG CTG ACA ATG AGA CAG AAC GCA ATT GCT CCT 
r lle Ser j Ala Val Asn Ala Leu Thr Met Arg Gin Asn Ala lie Ala Pro 

50 55 60 



GCT AGG j CCT CCC AAA CCA AAG AAG AAG AAG ACA ACC AAA CCA AAG CCG 
Ala Arg ; Pro Pro Lys Pro Lys Lys Lys Lys Thr Thr Lys Pro Lys Pro 
65 70 75 



AAA ACG 
Lys Thr 



AAA GAC 
Lys Asp 
95 



CAG CCC AAG AAG ATC AAC GGA AAA ACG CAG CAG CAA AAG AAG 
Gin Pro Lys Lys lie Asn Gly Lys Thr Gin Gin Gin Lys Lys 
80 85 90 

AAG CAA GCC GAC AAG AAG AAG AAG AAA CCC GGA AAA AGA GAA 
Lys Gin Ala Asp Lys Lys Lys Lys Lys Pro Gly Lys Arg Glu 

100 105 



AGA ATG-TGC ATG AAG ATT GAA AAT GAC TGT ATC TTC GAA GTC AAA CAC 
Arg Met Cys Met Lys lie Glu Asn Asp Cys lie Phe Glu Val Lys His 
110 115 120 125 
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GAA GGA AAG GTC ACT GGG TAC GCC TGC CTG GTG GGC GAC AAA GTC ATG 7843 

Glu Gly Lys Val Thr Gly Tyr Ala Cys Leu Val Gly Asp Lys Val Met 

130 135 140 

AAA CCT GCC CAC GTG AAA GGA GTC ATC GAC AAC GCG GAC CTG GCA AAG 78 91 

Lys Pro Ala His Val Lys Gly Val lie Asp Asn Ala Asp Leu Ala Lys 
145 *' 150 155 

CTA GCT TTC AAG AAA TCG AGC AAG TAT GAC CTT GAG TGT GCC CAG ATA 793 9 

Leu Ala Phe Lys Lys Ser Ser Lys Tyr Asp Leu Glu Cys Ala Gin lie 
160 165 170 

CCA GTT CAC ATG AGG TCG GAT GCC TCA AAG TAC ACG CAT GAG AAG CCC 7987 
Pro Val His Met Arg Ser Asp Ala Ser Lys Tyr Thr His Glu Lys Pro 
175 180 185 

GAG GGA CAC TAT AAC TGG CAC CAC GGG GCT GTT CAG TAC AGC GGA GGT 8 035 

Glu Gly His Tyr Asn Trp His His Gly Ala Val Gin Tyr Ser Gly Gly 
1^0 195 200 205 

A§G TTC ACT ATA CCG ACA GGA GCG GGC AAA CCG GGA GAC AGT GGC CGG 8083 
j£Bg Phe Thr lie Pro Thr Gly Ala Gly Lys Pro Gly Asp Ser Gly Arg 
H ; 210 215 220 

dOC ATC TTT GAC AAC AAG GGG AGG GTA GTC GCT ATC GTC CTG GGC GGG 8131 
m.o lie Phe Asp Asn Lys Gly Arg Val Val Ala lie Val Leu Gly Gly 
T 225 230 235 



GQC AAC GAG GGC TCA CGC ACA GCA CTG TCG GTG GTC ACC TGG AAC AAA 8179 
Ala Asn Glu Gly Ser Arg Thr Ala Leu Ser Val Val Thr Trp Asn Lys 
24 0 245 250 

CpT ATG GTG ACT AGA GTG ACC CCC GAG GGG TCC GAA GAG TGG TCC GCC 822 7 

Asp Met Val Thr Arg Val Thr Pro Glu Gly Ser Glu Glu Trp Ser Ala 
255 260 265 

CCG CTG ATT ACT GCC ATG TGT GTC CTT GCC AAT GCT ACC TTC CCG TGC 8 2 75 

Pro Leu lie Thr Ala Met Cys Val Leu Ala Asn Ala Thr Phe Pro Cys 
270 275 280 285 

TTC CAG CCC CCG TGT GTA CCT TGC TGC TAT GAA AAC AAC GCA GAG GCC 8323 
Phe Gin Pro Pro Cys Val Pro Cys Cys Tyr Glu Asn Asn Ala Glu Ala 

290 295 300 

ACA CTA CGG ATG CTC GAG GAT AAC GTG GAT AGG CCA GGG TAC TAC GAC 8371 
Thr Leu Arg Met Leu Glu Asp Asn Val Asp Arg Pro Gly Tyr Tyr Asp 
305 310 315 

CTC CTT CAG GCA GCC TTG ACG TGC CGA AAC GGA ACA AGA CAC CGG CGC 8419 
Leu Leu Gin Ala Ala Leu Thr Cys Arg Asn Gly Thr Arg His Arg Arg 
320 325 330 
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AGC GTG TCG CAA CAC TTC AAC GTG TAT AAG GCT ACA CGC CCT TAC ATC 84 67 

Ser Val Ser Gin His Phe Asn Val Tyr Lys Ala Thr Arg Pro Tyr lie 
335 340 345 

GCG TAC TGC GCC GAC TGC GGA GCA GGG CAC TCG TGT CAT AGC CCC GTA 8515 
Ala Tyr Cys Ala Asp Cys Gly Ala Gly His Ser Cys His Ser Pro Val 
-350 355 360 365 

GCA ATT GAA GCG GTC AGG TCC GAA GCT ACC GAC GGG ATG CTG AAG ATT 8563 
Ala lie Glu Ala Val Arg Ser Glu Ala Thr Asp Gly Met Leu Lys lie 

370 375 380 

CAG TTC TCG GCA CAA ATT GGC ATA GAT AAG AGT GAC AAT CAT GAC TAC 8611 
Gin Phe Ser Ala Gin lie Gly lie Asp Lys Ser Asp Asn His Asp Tyr 
385 390 395 

ACG AAG ATA AGG TAC GCA GAC GGG CAC GCC ATT GAG AAT GCC GTC CGG 8659 

Thr Lys lie Arg Tyr Ala Asp Gly His Ala lie Glu Asn Ala Val Arg 

U 400 405 410 

i&A TCT TTG AAG GTA GCC ACC TCC GGA GAC TGT TTC GTC CAT GGC ACA 8 707 

Ilr Ser Leu Lys Val Ala Thr Ser Gly Asp Cys Phe Val His Gly Thr 
415 420 425 

ATG GGA CAT TTC ATA CTG GCA AAG TGC CCA CCG GGT GAA TTC CTG CAG 8 755 

ffet Gly His Phe lie Leu Ala Lys Cys Pro Pro Gly Glu Phe Leu Gin 
430 435 440 445 

Gf C TCG ATC CAG GAC ACC AGA AAC GCG GTC CGT GCC TGC AGA ATA CAA 8 8 03 

Val Ser lie Gin Asp Thr Arg Asn Ala Val Arg Ala Cys Arg lie Gin 
S 450 455 460 

fir CAT CAT GAC CCT CAA CCG GTG GGT AGA GAA AAA TTT ACA ATT AGA 8851 

fyr His His Asp Pro Gin Pro Val Gly Arg Glu Lys Phe Thr lie Arg 

465 470 475 

CCA CAC TAT GGA AAA GAG ATC CCT TGC ACC ACT TAT CAA CAG ACC ACA 8899 
Pro His Tyr Gly Lys Glu lie Pro Cys Thr Thr Tyr Gin Gin Thr Thr 
480 485 490 

GCG AAG ACC GTG GAG GAA ATC GAC ATG CAT ATG CCG CCA GAT ACG CCG 8 947 

Ala Lys Thr Val Glu Glu lie Asp Met His Met Pro Pro Asp Thr Pro 
495 500 505 

GAC AGG ACG TTG CTA TCA CAG CAA TCT GGC AAT GTA AAG ATC ACA GTC 8 9 95 

Asp Arg Thr Leu Leu Ser Gin Gin Ser Gly Asn Val Lys lie Thr Val 
510 515 520 525 

GGA GGA AAG AAG GTG AAA TAC AAC TGC ACC TGT GGA ACC GGA AAC GTT 9043 
Gly Gly Lys Lys Val Lys Tyr Asn Cys Thr Cys Gly Thr Gly Asn Val 

530 535 540 
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GGC ACT ACT AAT TCG GAC ATG ACG ATC AAC ACG TGT CTA ATA GAG CAG 90 91 

Gly Thr Thr Asn Ser Asp Met Thr lie Asn Thr Cys Leu lie Glu Gin 
545 550 - 555 

TGC CAC GTC TCA GTG ACG GAC CAT AAG AAA TGG CAG TTC AAC TCA CCT 913 9 

Cys His Val Ser Val Thr Asp His Lys Lys Trp Gin Phe Asn Ser Pro 
560 ** 565 570 

TTC GTC CCG AGA GCC GAC GAA CCG GCT AGA AAA GGC AAA GTC CAT ATC 9187 
Phe Val Pro Arg Ala Asp Glu Pro Ala Arg Lys Gly Lys Val His lie 
575 580 585 

CCA TTC CCG TTG GAC AAC ATC ACA TGC AGA GTT CCA ATG GCG CGC GAA 923 5 

Pro Phe Pro Leu Asp Asn lie Thr Cys Arg Val Pro Met Ala Arg Glu 
590 595 600 605 

CCA ACC GTC ATC CAC GGC AAA AGA GAA GTG ACA CTG CAC CTT CAC CCA 92 83 

Prp Thr Val lie His Gly Lys Arg Glu Val Thr Leu His Leu His Pro 
; y 610 615 620 

GpT CAT CCC ACG CTC TTT TCC TAC CGC ACA CTG GGT GAG GAC CCG CAG 93 31 

His Pro Thr Leu Phe Ser Tyr Arg Thr Leu Gly Glu Asp Pro Gin 
625 630 635 

CAC GAG GAA TGG GTG ACA GCG GCG GTG GAA CGG ACC ATA CCC GTA 9379 
His Glu Glu Trp Val Thr Ala Ala Val Glu Arg Thr lie Pro Val 
s 640 645 650 

CC?K GTG GAC GGG ATG GAG TAC CAC TGG GGA AAC AAC GAC CCA GTG AGG 9427 
P&jp Val Asp Gly Met Glu Tyr His Trp Gly Asn Asn Asp Pro Val Arg 
655 660 665 



TGG TCT CAA CTC ACC ACT GAA GGG AAA CCG CAC GGC TGG CCG CAT 9475 
Liu Trp Ser Gin Leu Thr Thr Glu Gly Lys Pro His Gly Trp Pro His 
670 675 680 685 

CAG ATC GTA CAG TAC TAC TAT GGG CTT TAC CCG GCC GCT ACA GTA TCC 9523 
Gin He Val Gin Tyr Tyr Tyr Gly Leu Tyr Pro Ala Ala Thr Val Ser 

690 695 700 

GCG GTC GTC GGG ATG AGC TTA CTG GCG TTG ATA TCG ATC TTC GCG TCG 95 71 

Ala Val Val Gly Met Ser Leu Leu Ala Leu He Ser He Phe Ala Ser 
705 710 715 

TGC TAC ATG CTG GTT GCG GCC CGC AGT AAG TGC TTG ACC CCT TAT GCT 9619 
Cys Tyr Met Leu Val Ala Ala Arg Ser Lys Cys Leu Thr Pro Tyr Ala 
720 725 730 

TTA ACA CCA GGA GCT GCA GTT CCG TGG ACG CTG GGG ATA CTC TGC TGC 9 667 

Leu Thr Pro Gly Ala Ala Val Pro Trp Thr Leu Gly He Leu Cys Cys 
735 740 745 
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GCC CCG CGG GCG CAC GCA GCT AGT GTG GCA GAG ACT ATG GCC .TAG TTG 9715 
Ala Pro Arg Ala His Ala Ala Ser Val Ala Glu Thr Met Ala Tyr Leu 
750 755 760 765 

TGG GAC CAA AAC CAA GCG TTG TTC TGG TTG GAG TTT GCG GCC CCT GTT 9763 
Trp Asp Gin Asn Gin Ala Leu Phe Trp Leu Glu Phe Ala Ala Pro Val 

770 775 780 

GCC TGC ATC CTC ATC ATC ACG TAT TGC CTC AGA AAC GTG CTG TGT TGC 9811 
Ala Cys lie Leu lie lie Thr Tyr Cys Leu Arg Asn Val Leu Cys Cys 
785 790 795 

TGT AAG AGC CTT TCT TTT TTA GTG CTA CTG AGC CTC GGG GCA ACC GCC 9859 
Cys Lys Ser Leu Ser Phe Leu Val Leu Leu Ser Leu Gly Ala Thr Ala 
800 805 810 

AGA GCT TAC GAA CAT TCG ACA GTA ATG CCG AAC GTG GTG GGG TTC CCG 9907 
Arg Ala Tyr Glu His Ser Thr Val Met Pro Asn Val Val Gly Phe Pro 
it 815 820 825 



*£&T AAG GCT CAC ATT GAA AGG CCA GGA TAT AGC CCC CTC ACT TTG CAG 9 955 

ffr Lys Ala His lie Glu Arg Pro Gly Tyr Ser Pro Leu Thr Leu Gin 
|30 835 840 845 

AiG CAG GTT GTT GAA ACC AGC CTC GAA CCA ACC CTT AAT TTG GAA TAC 10003 
ilt Gin Val Val Glu Thr Ser Leu Glu Pro Thr Leu Asn Leu Glu Tyr 
b 850 .855 860 

MA ACC TGT GAG TAC AAG ACG GTC GTC CCG TCG CCG TAC GTG AAG TGC 10051 
lie Thr Cys Glu Tyr Lys Thr Val Val Pro Ser Pro Tyr Val Lys Cys 
O 865 870 875 

||C GGC GCC TCA GAG TGC TCC ACT AAA GAG AAG CCT GAC TAC CAA TGC 100 99 

Cys Gly Ala Ser Glu Cys Ser Thr Lys Glu Lys Pro Asp Tyr Gin Cys 
880 885 890 

AAG GTT TAC ACA GGC GTG TAC CCG TTC ATG TGG GGA GGG GCA TAT TGC 10147 
Lys Val Tyr Thr Gly Val Tyr Pro Phe Met Trp Gly Gly Ala Tyr Cys 
895 900 905 

TTC TGC GAC TCA GAA AAC ACG CAA CTC AGC GAG GCG TAC GTC GAT CGA 10195 
Phe Cys Asp Ser Glu Asn Thr Gin Leu Ser Glu Ala Tyr Val Asp Arg 
910 915 920 925 

TCG GAC GTA TGC AGG CAT GAT CAC GCA TCT GCT TAC AAA GCC CAT ACA 10243 
Ser Asp Val Cys Arg His Asp His Ala Ser Ala Tyr Lys Ala His Thr 

930 935 940 

GCA TCG CTG AAG GCC AAA GTG AGG GTT ATG TAC GGC AAC GTA AAC CAG 102 91 

Ala Ser Leu Lys Ala Lys Val Arg Val Met Tyr Gly Asn Val Asn Gin 
945 950 955 
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ACT GTG GAT GTT TAC GTG AAC GGA GAC CAT GCC GTC ACG ATA GGG GGT 10339 
Thr Val Asp Val Tyr Val Asn Gly Asp His Ala Val Thr He Gly Gly 
960 965 970 

ACT CAG TTC ATA TTC GGG CCG CTG TCA TCG GCC TGG ACC CCG TTC GAC 10387 
Thr Gin Phe He Phe Gly Pro Leu Ser Ser Ala Trp Thr Pro Phe Asp 
975 - 980 985 

AAC AAG ATA GTC GTG TAC AAA GAC GAA GTG TTC AAT CAG GAC TTC CCG 1043 5 

Asn Lys He Val Val Tyr Lys Asp Glu Val Phe Asn Gin Asp Phe Pro 
990 995 1000 1005 

CCG TAC GGA TCT GGG CAA CCA GGG CGC TTC GGC GAC ATC CAA AGC AGA 10483 
Pro Tyr Gly Ser Gly Gin Pro Gly Arg Phe Gly Asp He Gin Ser Arg 

1010 1015 1020 

ACA GTG GAG AGT AAC GAC CTG TAC GCG AAC ACG GCA CTG AAG CTG GCA 10531 
Thr Val Glu Ser Asn Asp Leu Tyr Ala Asn Thr Ala Leu Lys Leu Ala 
1025 1030 1035 

few 

C§t CCT TCA CCC GGC ATG GTC CAT GTA CCG TAC ACA CAG ACA CCT TCA 10579 
Af:g Pro Ser Pro Gly Met Val His Val Pro Tyr Thr Gin Thr Pro Ser 
£ 1040 1045 1050 

Gfe TTC AAA TAT TGG CTA AAG GAA AAA GGG ACA GCC CTA AAT ACG AAG 10627 
Gly Phe Lys Tyr Trp Leu Lys Glu Lys Gly Thr Ala Leu Asn Thr Lys 
1055 1060 1065 

GST CCT TTT GGC TGC CAA ATC AAA ACG AAC CCT GTC AGG GCC ATG AAC 10 675 

Al^a Pro Phe Gly Cys Gin He Lys Thr Asn Pro Val Arg Ala Met Asn 
1P*0 1075 1080 1085 



TGC GCC GTG GGA AAC ATC CCT GTC TCC ATG AAT TTG CCT GAC AGC GCC 10723 
Cys Ala Val Gly Asn He Pro Val Ser Met Asn Leu Pro Asp Ser Ala 

1090 1095 1100 

TTT ACC CGC ATT GTC GAG GCG CCG ACC ATC ATT GAC CTG ACT TGC ACA 10771 
Phe Thr Arg He Val Glu Ala Pro Thr He He Asp Leu Thr Cys Thr 
H05 1110 1115 

GTG GCT ACC TGT ACG CAC TCC TCG GAT TTC GGC GGC GTC TTG ACA CTG 10 819 

Val Ala Thr Cys Thr His Ser Ser Asp Phe Gly Gly Val Leu Thr Leu 
H20 1125 1130 

ACG TAC AAG ACC AAC AAG AAC GGG GAC TGC TCT GTA CAC TCG CAC TCT 10867 
Thr Tyr Lys Thr Asn Lys Asn Gly Asp Cys Ser Val His Ser His Ser 
1135 1140 1145 

AAC GTA GCT ACT CTA CAG GAG GCC ACA GCA AAA GTG AAG ACA GCA GGT 10915 
Asn Val Ala Thr Leu Gin Glu Ala Thr Ala Lys Val Lys Thr Ala Gly 
H50 1155 1160 1165 



I 
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AAG GTG ACC TTA CAC TTC TCC ACG GCA AGC GCA TCA CCT TCT TTT GTG 10963 
Lys Val Thr Leu His Phe Ser Thr Ala Ser Ala Ser Pro Ser Phe Val 

1170 1175 1180 

GTG TCG CTA TGC AGT GCT AGG GCC ACC TGT TCA GCG TCG TGT GAG CCC 11011 
Val Ser Leu Cys Ser Ala Arg Ala Thr Cys Ser Ala Ser Cys Glu Pro 
1185 •- 1190 1195 

CCG AAA GAC CAC ATA GTC CCA TAT GCG GCT AGC CAC AGT AAC GTA GTG 11059 
Pro Lys Asp His lie Val Pro Tyr Ala Ala Ser His Ser Asn Val Val 
1200 1205 1210 

TTT CCA GAC ATG TCG GGC ACC GCA CTA TCA TGG GTG CAG AAA ATC TCG 11107 
Phe Pro Asp Met Ser Gly Thr Ala Leu Ser Trp Val Gin Lys lie Ser 
1215 1220 1225 

GGT GGT CTG GGG GCC TTC GCA ATC GGC GCT ATC CTG GTG CTG GTT GTG 11155 
_Gly Gly Leu Gly Ala Phe Ala He Gly Ala He Leu Val Leu Val Val 
13.230 1235 1240 1245 

JETC ACT TGC ATT GGG CTC CGC AGA TAAGTTAGGG TAGGCAATGG CATTGATATA 11209 
Q/al Thr Cys He Gly Leu Arg Arg 
M= 1250 

HpCAAGAAAAT TGAAAACAGA AAAAGTTAGG GTAAGCAATG GCATATAACC ATAACTGTAT 11269 

IaACTTGTAAC AAAGCGCAAC AAGACCTGCG CAATTGGCCC CGTGGTCCGC CTCACGGAAA 11329 

*JTCGGGGCAA CTCATATTGA CACATTAATT GGCAATAATT GGAAGCTTAC ATAAGCTTAA 11389 

pfTCGACGAAT AATTGGATTT TTATTTTATT TTGCAATTGG TTTTTAATAT TTCCAAAAAA 11449 

^-AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 11509 

AAAACTAG imi7 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Ala Ala Lys Val His Val Asp He Glu Ala Asp Ser Pro Phe He 
15 10 15 

Lys Ser Leu Gin Lys Ala Phe Pro Ser Phe Glu Val Glu Ser Leu Gin 
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20 25 30 

Val Thr Pro Asn Asp Kis Ala Asn Ala Arg Ala Phe Ser His Leu Ala 
35 40 45 

Thr Lys Leu lie Glu Gin Glu Thr Asp Lys Asp Thr Leu lie Leu Asp 
50 55 60 

lie Gly Ser Ala Pro Ser Arg Arg Met Met Ser Thr His Lys Tyr His 
65 70 75 80 

Cys Val Cys Pro Met Arg Ser Ala Glu Asp Pro Glu Arg Leu Asp Ser 

85 90 95 

Tyr Ala Lys Lys Leu Ala Ala Ala Ser Gly Lys Val Leu Asp Arg Glu 
100 105 110 

lie Ala Gly Lys lie Thr Asp Leu Gin Thr Val Met Ala Thr Pro Asp 
115 120 125 

Ajk Glu Ser Pro Thr Phe Cys Leu His Thr Asp Val Thr Cys Arg Thr 
H 13 0 13 5 14 0 

Atei Ala Glu Val Ala Val Tyr Gin Asp Val Tyr Ala Val His Ala Pro 
life 150 155 160 

Thr Ser Leu Tyr His Gin Ala Met Lys Gly Val Arg Thr Ala Tyr Trp 
O 165 170 175 

Iplie Gly Phe Asp Thr Thr Pro Phe Met Phe Asp Ala Leu Ala Gly Ala 
O 180 185 190 

T||r Pro Thr Tyr Ala Thr Asn Trp Ala Asp Glu Gin Val Leu Gin Ala 
195 200 205 

Arg Asn lie Gly Leu Cys Ala Ala Ser Leu Thr Glu Gly Arg Leu Gly 
210 215 220 

Lys Leu Ser lie Leu Arg Lys Lys Gin Leu Lys Pro Cys Asp Thr Val 
225 230 235 240 

Met Phe Ser Val Gly Ser Thr Leu Tyr Thr Glu Ser Arg Lys Leu Leu 

245 250 255 

Arg Ser Trp His Leu Pro Ser Val Phe His Leu Lys Gly Lys Gin Ser 
260 265 270 

Phe Thr Cys Arg Cys Asp Thr lie Val Ser Cys Glu Gly Tyr Val Val 
275 280 285 

Lys Lys lie Thr Met Cys Pro Gly Leu Tyr Gly Lys Thr Val Gly Tyr 
290 295 300 
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Ala Val Thr Tyr His Ala Glu Gly Phe Leu Val Cys Lys Thr Thr Asp 
305 310 315 320 

Thr Val Lys Gly Glu Arg Val Ser Phe Pro Val Cys Thr Tyr Val Pro 

325 330 335 

Ser Thr He Cys Asp Gin Met Thr Gly He Leu Ala Thr Asp Val Thr 
340 345 350 

Pro Glu Asp Ala Gin Lys Leu Leu Val Gly Leu Asn Gin Arg He Val 
355 360 365 

Val Asn Gly Arg Thr Gin Arg Asn Thr Asn Thr Met Lys Asn Tyr Leu 
370 375 380 

Leu Pro He Val Ala Val Ala Phe Ser Lys Trp Ala Arg Glu Tyr Lys 
385 390 395 400 

Ma Asp Leu Asp Asp Glu Lys Pro Leu Gly Val Arg Glu Arg Ser Leu 
]4 405 410 415 

Tpar Cys Cys Cys Leu Trp Ala Phe Lys Thr Arg Lys Met His Thr Met 
^ 420 425 43 0 

iyr Lys Lys Pro Asp Thr Gin Thr He Val Lys Val Pro Ser Glu Phe 
U 435 440 445 

Asn Ser Phe Val He Pro Ser Leu Trp Ser Thr Gly Leu Ala lie Pro 

O 450 455 460 

Val Arg Ser Arg He Lys Met Leu Leu Ala Lys Lys Thr Lys Arg Glu 

4B5 470 475 480 

Umu lie Pro Val Leu Asp Ala Ser Ser Ala Arg Asp Ala Glu Gin Glu 

485 490 495 

Glu Lys Glu Arg Leu Glu Ala Glu Leu Thr Arg Glu Ala Leu Pro Pro 
500 505 510 

Leu Val Pro He Ala Pro Ala Glu Thr Gly Val Val Asp Val Asp Val 
515 520 525 

Glu Glu Leu Glu Tyr His Ala Gly Ala Gly Val Val Glu Thr Pro Arg 
530 535 540 

Ser Ala Leu Lys Val Thr Ala Gin Pro Asn Asp Val Leu Leu Gly Asn 
545 550 555 560 

Tyr Val Val Leu Ser Pro Gin Thr Val Leu Lys Ser Ser Lys Leu Ala 

565 570 575 
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Pro Val His Pro Leu Ala Glu Gin Val Lys He He Thr His Asn Gly 
580 585 590 

Arg Ala Gly Gly Tyr Gin Val Asp Gly Tyr Asp Gly Arg Val Leu Leu 
595 600 605 

Pro Cys Gly Ser Ala He Pro Val Pro Glu Phe Gin Ala Leu Ser Glu 
610 615 620 

Ser Ala Thr Met Val Tyr Asn Glu Arg Glu Phe Val Asn Arg Lys Leu 
625 630 635 640 

Tyr His He Ala Val His Gly Pro Ser Leu Asn Thr Asp Glu Glu Asn 

645 650 655 

Tyr Glu Lys Val Arg Ala Glu Arg Thr Asp Ala Glu Tyr Val Phe Asp 
660 665 670 

\fal Asp Lys Lys Cys Cys Val Lys Arg Glu Glu Ala Ser Gly Leu Val 
5 675 680 685 

Iplu Val Gly Glu Leu Thr Asn Pro Pro Phe His Glu Phe Ala Tyr Glu 

71 690 695 700 



Leu Lys He Arg Pro Ser Ala Pro Tyr Lys Thr Thr Val Val Gly 
7g5 710 715 720 

Val Phe Gly Val Pro Gly Ser Gly Lys Ser Ala He He Lys Ser Leu 
P 725 730 735 

V*l Thr Lys His Asp Leu Val Thr Ser Gly Lys Lys Glu Asn Cys Gin 
C3 740 745 750 

gJu He Val Asn Asp Val Lys Lys His Arg Gly Lys Gly Thr Ser Arg 
755 760 765 

Glu Asn Ser Asp Ser He Leu Leu Asn Gly Cys Arg Arg Ala Val Asp 
770 775 780 

He Leu Tyr Val Asp Glu Ala Phe Ala Cys His Ser Gly Thr Leu Leu 
785 790 795 800 

Ala Leu He Ala Leu Val Lys Pro Arg Ser Lys Val Val Leu Cys Gly 

805 810 815 

Asp Pro Lys Gin Cys Gly Phe Phe Asn Met Met Gin Leu Lys Val Asn 
820 825 830 

Phe Asn His Asn He Cys Thr Glu Val Cys His Lys Ser He Ser Arg 
835 840 845 
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Arg Cys Thr Arg Pro Val Thr Ala lie Val Ser Thr Leu His Tyr Gly 
850 855 860 

Gly Lys Met Arg Thr Thr Asn Pro Cys Asn Lys Pro lie lie lie Asp 
865 870 875 880 

Thr Thr Gly Gin Thr Lys Pro Lys Pro Gly Asp lie Val Leu Thr Cys 

885 890 895 

Phe Arg Gly Trp Ala Lys Gin Leu Gin Leu Asp Tyr Arg Gly His Glu 
900 905 910 

Val Met Thr Ala Ala Ala Ser Gin Gly Leu Thr Arg Lys Gly Val Tyr 
915 920 925 

Ala Val Arg Gin Lys Val Asn Glu Asn Pro Leu Tyr Ala Pro Ala Ser 
930 935 940 



His Val Asn Val Leu Leu Thr Arg Thr Glu Asp Arg Leu Val Trp 
9#5 950 955 960 

Ljpk Thr Leu Ala Gly Asp Pro Trp lie Lys Val Leu Ser Asn lie Pro 
H ; 965 970 975 

Gill Gly Asn Phe Thr Ala Thr Leu Glu Glu Trp Gin Glu Glu His Asp 
Ul 980 985 990 

Lf| lie Met Lys Val He Glu Gly Pro Ala Ala Pro Val Asp Ala Phe 
%j 995 1000 1005 

Gfep. Asn Lys Ala Asn Val Cys Trp Ala Lys Ser Leu Val Pro Val Leu 
S 1010 1015 1020 

Asp Thr Ala Gly He Arg Leu Thr Ala Glu Glu Trp Ser Thr He He 
1025 1030 1035 1040 

Thr Ala Phe Lys Glu Asp Arg Ala Tyr Ser Pro Val Val Ala Leu Asn 

1045 1050 1055 

Glu He Cys Thr Lys Tyr Tyr Gly Val Asp Leu Asp Ser Gly Leu Phe 
1060 1065 1070 

Ser Ala Pro Lys Val Ser Leu Tyr Tyr Glu Asn Asn His Trp Asp Asn 
1075 1080 1085 

Arg Pro Gly Gly Arg Met Tyr Gly Phe Asn Ala Ala Thr Ala Ala Arg 
1090 1095 1100 

Leu Glu Ala Arg His Thr Phe Leu Lys Gly Gin Trp His Thr Gly Lys 
1105 1110 1115 1120 
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Gin Ala Val lie Ala Glu Arg Lys lie Gin Pro Leu Ser Val Leu Asp 

1125 1130 1135 

Asn Val lie Pro lie Asn Arg Arg Leu Pro His Ala Leu Val Ala Glu 
1140 1145 1150 

Tyr Lys Thr Val Lys Gly Ser Arg Val Glu Trp Leu Val Asn Lys Val 
1155 1160 1165 

Arg Gly Tyr His Val Leu Leu Val Ser Glu Tyr Asn Leu Ala Leu Pro 
1170 1175 1180 

Arg Arg Arg Val Thr Trp Leu Ser Pro Leu Asn Val Thr Gly Ala Asp 
1185 1190 1195 1200 

Arg Cys Tyr Asp Leu Ser Leu Gly Leu Pro Ala Asp Ala Gly Arg Phe 

1205 1210 1215 

Asp Leu Val Phe Val Asn He His Thr Glu Phe Arg He His His Tyr 
% 1220 1225 1230 

<|ln Gin Cys Val Asp His Ala Met Lys Leu Gin Met Leu Gly Gly Asp 
f" 1235 1240 1245 

It a Leu Arg Leu Leu Lys Pro Gly Gly He Leu Met Arg Ala Tyr Gly 
B" 5 1250 1255 1260 

%r Ala Asp Lys He Ser Glu Ala Val Val Ser Ser Leu Ser Arg Lys 
±265 1270 1275 1280 

Hie Ser Ser Ala Arg Val Leu Arg Pro Asp Cys Val Thr Ser Asn Thr 
q 1285 1290 1295 

Glu Val Phe Leu Leu Phe Ser Asn Phe Asp Asn Gly Lys Arg Pro Ser 
1300 1305 1310 

Thr Leu His Gin Met Asn Thr Lys Leu Ser Ala Val Tyr Ala Gly Glu 
1315 1320 1325 

Ala Met His Thr Ala Gly Cys Ala Pro Ser Tyr Arg Val Lys Arg Ala 
1330 1335 1340 

Asp He Ala Thr Cys Thr Glu Ala Ala Val Val Asn Ala Ala Asn Ala 
1345 1350 1355 1360 

Arg Gly Thr Val Gly Asp Gly Val Cys Arg Ala Val Ala Lys Lys Trp 

1365 1370 1375 

Pro Ser Ala Phe Lys Gly Ala Ala Thr Pro Val Gly Thr He Lys Thr 
1380 1385 1390 

Val Met Cys Gly Ser Tyr Pro Val He His Ala Val Ala Pro Asn Phe 
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1395 1400 1405 

Ser Ala Thr Thr Glu Ala Glu Gly Asp Arg Glu Leu Ala Ala Val Tyr 
1410 1415 1420 

Arg Ala Val Ala Ala Glu Val Asn Arg Leu Ser Leu Ser Ser Val Ala 
'1425 i430 1435 1440 

lie Pro Leu Leu Ser Thr Gly Val Phe Ser Gly Gly Arg Asp Arg Leu 

1445 1450 1455 

Gin Gin Ser Leu Asn His Leu Phe Thr Ala Met Asp Ala Thr Asp Ala 
1460 1465 1470 

Asp Val Thr lie Tyr Cys Arg Asp Lys Ser Trp Glu Lys Lys lie Gin 
1475 1480 1485 

Glu Ala lie Asp Met Arg Thr Ala Val Glu Leu Leu Asn Asp Asp Val 
% 1490 1495 1500 

IfLu Leu Thr Thr Asp Leu Val Arg Val His Pro Asp Ser Ser Leu Val 
^%05 1510 1515 1520 

|Sly Arg Lys Gly Tyr Ser Thr Thr Asp Gly Ser Leu Tyr Ser Tyr Phe 
O 1525 1530 1535 

Glu Gly Thr Lys Phe Asn Gin Ala Ala lie Asp Met Ala Glu lie Leu 
O 1540 1545 1550 

Kir Leu Trp Pro Arg Leu Gin Glu Ala Asn Glu Gin lie Cys Leu Tyr 
p 1555 1560 1565 

Ala Leu Gly Glu Thr Met Asp Asn lie Arg Ser Lys Cys Pro Val Asn 
r ' 1570 1575 1580 

Asp Ser Asp Ser Ser Thr Pro Pro Arg Thr Val Pro Cys Leu Cys Arg 
1585 1590 1595 1600 

Tyr Ala Met Thr Ala Glu Arg lie Ala Arg Leu Arg Ser His Gin Val 

1605 1610 1615 

Lys Ser Met Val Val Cys Ser Ser Phe Pro Leu Pro Lys Tyr His Val 
1620 1625 1630 

Asp Gly Val Gin Lys Val Lys Cys Glu Lys Val Leu Leu Phe Asp Pro 
1635 1640 1645 

Thr Val Pro Ser Val Val Ser Pro Arg Lys Tyr Ala Ala Ser Thr Thr 
1650 1655 1660 

Asp His Ser Asp Arg Ser Leu Arg Gly Phe Asp Leu Asp Trp Thr Thr 
1665 1670 1675 1680 
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Asp Ser Ser Ser Thr Ala Ser Asp Thr Met Ser Leu Pro Ser Leu Gin 

1685 1690 1695 

Ser Cys Asp lie Asp Ser lie Tyr Glu Pro Met Ala Pro lie Val Val 
1700 1705 1710 

Thr Ala Asp Val His Pro Glu Pro Ala Gly lie Ala Asp Leu Ala Ala 
1715 1720 1725 

Asp Val His Pro Glu Pro Ala Asp His Val Asp Leu Glu Asn Pro lie 
1730 1735 1740 

Pro Pro Pro Arg Pro Lys Arg Ala Ala Tyr Leu Ala Ser Arg Ala Ala 
1745 1750 1755 ' 1760 

Glu Arg Pro Val Pro Ala Pro Arg Lys Pro Thr Pro Ala Pro Arg Thr 

1765 1770 1775 

Alia Phe Arg Asn Lys Leu Pro Leu Thr Phe Gly Asp Phe Asp Glu His 
1780 - 1785 1790 



Glu Val Asp Ala Leu Ala Ser Gly He Thr Phe Gly Asp Phe Asp Asp 
H 1795 1800 1805 

vSL Leu Arg Leu Gly Arg Ala Gly Ala Tyr He Phe Ser Ser Asp Thr 
y;; 1810 1815 1820 



Ser Gly His Leu Gin Gin Lys Ser Val Arg Gin His Asn Leu Gin 
lM>5 1830 1835 1840 

Ala Gin Leu Asp Ala Val Gin Glu Glu Lys Met Tyr Pro Pro Lys 
O 1845 1850 1855 

Leu Asp Thr Glu Arg Glu Lys Leu Leu Leu Leu Lys Met Gin Met His 
1860 1865 1870 

Pro Ser Glu Ala Asn Lys Ser Arg Tyr Gin Ser Arg Lys Val Glu Asn 
1875 1880 1885 

Met Lys Ala Thr Val Val Asp Arg Leu Thr Ser Gly Ala Arg Leu Tyr 
1890 1895 1900 

Thr Gly Ala Asp Val Gly Arg He Pro Thr Tyr Ala Val Arg Tyr Pro 
1905 1910 1915 1920 

Arg Pro Val Tyr Ser Pro Thr Val He Glu Arg Phe Ser Ser Pro Asp 

1925 1930 1935 

Val Ala He Ala Ala Cys Asn Glu Tyr Leu Ser Arg Asn Tyr Pro Thr 
1940 1945 1950 



Val Ala Ser Tyr Gin lie Thr Asp Glu Tyr Asp Ala Tyr Leu Asp Met 
1955 1960 1965 

Val Asp Gly Ser Asp Ser Cys Leu Asp Arg Ala Thr Phe Cys Pro Ala 
1970 1975 1980 

Lys Leu Arg Cys Tyr Pro Lys His His Ala Tyr His Gin Pro Thr Val 
1985 1990 1995 2000 

Arg Ser Ala Val Pro Ser Pro Phe Gin Asn Thr Leu Gin Asn Val Leu 

2005 2010 2015 

Ala Ala Ala Thr Lys Arg Asn Cys Asn Val Thr Gin Met Arg Glu Leu 
2020 2025 2030 

Pro Thr Met Asp Ser Ala Val Phe Asn Val Glu Cys Phe Lys Arg Tyr 
2035 2040 2045 

Ala Cys Ser Gly Glu Tyr Trp Glu Glu Tyr Ala Lys Gin Pro lie Arg 
1" 2050 2055 2060 

fie Thr Thr Glu Asn lie Thr Thr Tyr Val Thr Lys Leu Lys Gly Pro 

|q65 2070 2075 2080 

r"""~ 

Bfs Ala Ala Ala Leu Phe Ala Lys Thr His Asn Leu Val Pro Leu Gin 
^ 2085 2090 2095 

(flu Val Pro Met Asp Arg Phe Thr Val Asp Met Lys Arg Asp Val Lys 
H 2100 2105 2110 



Val Thr Pro Gly Thr Lys His Thr Glu Glu Arg Pro Lys Val Gin Val 
O 2115 2120 2125 

lie Gin Ala Ala Glu Pro Leu Ala Thr Ala Tyr Leu Cys Gly lie His 
2130 2135 2140 

Arg Glu Leu Val Arg Arg Leu Asn Ala Val Leu Arg Pro Asn Val His 
2145 2150 2155 2160 

Thr Leu Phe Asp Met Ser Ala Glu Asp Phe Asp Ala lie lie Ala Ser 

2165 2170 2175 

His Phe His Pro Gly Asp Pro Val Leu Glu Thr Asp lie Ala Ser Phe 
2180 2185 2190 

Asp Lys Ser Gin Asp Asp Ser Leu Ala Leu Thr Gly Leu Met lie Leu 
2195 2200 2205 

Glu Asp Leu Gly Val Asp Gin Tyr Leu Leu Asp Leu lie Glu Ala Ala 
2210 2215 2220 
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Phe Gly Glu lie Ser Ser Cys His Leu Pro Thr Gly Thr Arg Phe Lys 
2225 2230 2235 2240 

Phe Gly Ala Met Met Lys Ser Gly Met Phe Leu Thr Leu Phe lie Asn 

2245 2250 2255 

Thr Val Leu Asn lie Thr lie Ala Ser Arg Val Leu Glu Gin Arg Leu 
2260 2265 2270 

Thr Asp Ser Ala Cys Ala Ala Phe He Gly Asp Asp Asn He Val His 
2275 2280 2285 

Gly Val He Ser Asp Lys Leu Met Ala Glu Arg Cys Ala Ser Trp Val 
2290 2295 2300 

Asn Met Glu Val Lys He He Asp Ala Val Met Gly Glu Lys Pro Pro 
2305 2310 2315 2320 

Tft Phe Cys Gly Gly Phe He Val Phe Asp Ser Val Thr Gin Thr Ala 
^ 2325 - 2330 2335 

C|S Arg Val Ser Asp Pro Leu Lys Arg Leu Phe Lys Leu Gly Lys Pro 
P 2340 2345 2350 

L€pL Thr Ala Glu Asp Lys Gin Asp Glu Asp Arg Arg Arg Ala Leu Ser 
§1 2355 2360 2365 

A*gg> Glu Val Ser Lys Trp Phe Arg Thr Gly Leu Gly Ala Glu Leu Glu 
Sj 2370 2375 2380 

V|i Ala Leu Thr Ser Arg Tyr Glu Val Glu Gly Cys Lys Ser He Leu 
23235 2390 2395 2400 

lie Ala Met Thr Thr Leu Ala Arg Asp He Lys Ala Phe Lys Lys Leu 

2405 2410 2415 

Arg Gly Pro Val He His Leu Tyr Gly Gly Pro Arg Leu Val Arg 
2420 2425 . 2430 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1253 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Asn Tyr He Pro Thr Gin Thr Phe Tyr Gly Arg Arg Trp Arg Pro 
" 15 10 15 

Arq Pro Ala Ala Arg Pro Trp Pro Leu Gin Ala Thr Pro Val Ala Pro 
20 25 30 

Val Val Pro Asp Phe Gin Ala Gin Gin Met Gin Gin Leu He Ser Ala 
35 40 45 

Val Asn Ala Leu Thr Met Arg Gin Asn Ala He Ala Pro Ala Arg Pro 
50 55 60 

Pro Lvs Pro Lys Lys Lys Lys Thr Thr Lys Pro Lys Pro Lys Thr Gin 
65 70 75 80 

■5ro Lvs Lys He Asn Gly Lys Thr Gin Gin Gin Lys Lys Lys Asp Lys 

£i 85 90 95 

t&ln Ala Asp Lys Lys Lys Lys Lys Pro Gly Lys Arg Glu Arg Met Cys 

U 100 105 HO 

met Lys He Glu Asn Asp Cys He Phe Glu Val Lys His Glu Gly Lys 
J 115 120 125 

*Yal Thr Gly Tyr Ala Cys Leu Val Gly Asp Lys Val Met Lys Pro Ala 
b 13 0 135 140 

Ltii s val Lys Gly Val He Asp Asn Ala Asp Leu Ala Lys Leu Ala Phe 

Ji 45 150 155 160 

^Lys Lys Ser Ser Lys Tyr Asp Leu Glu Cys Ala Gin He Pro Val His 

165 170 175 

Met Arq Ser Asp Ala Ser Lys Tyr Thr His Glu Lys Pro Glu Gly His 
180 185 190 

Tyr Asn Trp His His Gly Ala Val Gin Tyr Ser Gly Gly Arg Phe Thr 
195 200 205 

He Pro Thr Gly Ala Gly Lys Pro Gly Asp Ser Gly Arg Pro He Phe 
210 215 220 

Asp Asn Lys Gly Arg Val Val Ala He Val Leu Gly Gly Ala Asn Glu 
225 230 235 240 

Glv Ser Arq Thr Ala Leu Ser Val Val Thr Trp Asn Lys Asp Met Val 

245 250 255 

Thr Arq Val Thr Pro Glu Gly Ser Glu Glu Trp Ser Ala Pro Leu He 
260 265 270 
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Thr Ala Met Cys Val Leu Ala Asn Ala Thr Phe Pro Cys Phe Gin Pro 
275 280 285 

Pro Cys Val Pro Cys Cys Tyr Glu Asn Asn Ala Glu Ala Thr Leu Arg 
290 295 300 

Met Leu Glu Asp Asn Val Asp Arg Pro Gly Tyr Tyr Asp Leu Leu Gin 
305 310 315 320 

Ala Ala Leu Thr Cys Arg Asn Gly Thr Arg His Arg Arg Ser Val Ser 

325 330 335 

Gin His Phe Asn Val Tyr Lys Ala Thr Arg Pro Tyr He Ala Tyr Cys 
340 345 350 

Ala Asp Cys Gly Ala Gly His Ser Cys His Ser Pro Val Ala He Glu 
355 360 365 

jfijtja Val Arg Ser Glu Ala Thr Asp Gly Met Leu Lys He Gin Phe Ser 
^ 370 375 380 

Ma Gin He Gly He Asp Lys Ser Asp Asn His Asp Tyr Thr Lys He 

390 395 400 

I&g Tyr Ala Asp Gly His Ala He Glu Asn Ala Val Arg Ser Ser Leu 
£[1 405 410 415 

iCys Val Ala Thr Ser Gly Asp Cys Phe Val His Gly Thr Met Gly His 
' \i 420 425 430 

gjie He Leu Ala Lys Cys Pro Pro Gly Glu Phe Leu Gin Val Ser He 
f\ 435 440 445 

Gin Asp Thr Arg Asn Ala Val Arg Ala Cys Arg He Gin Tyr His His 
450 455 460 

Asp Pro Gin Pro Val Gly Arg Glu Lys Phe Thr He Arg Pro His Tyr 
465 470 475 480 

Gly Lys Glu He Pro Cys Thr Thr Tyr Gin Gin Thr Thr Ala Lys Thr 

485 490 495 

Val Glu Glu He Asp Met His Met Pro Pro Asp Thr Pro Asp Arg Thr 
500 505 510 

Leu Leu Ser Gin Gin Ser Gly Asn Val Lys He Thr Val Gly Gly Lys 
515 520 525 

Lys Val Lys Tyr Asn Cys Thr Cys Gly Thr Gly Asn Val Gly Thr Thr 
530 535 540 
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Asn Ser Asp Met Thr lie Asn Thr Cys Leu lie Glu Gin Cys His Val 
545 550 555 560 

Ser Val Thr Asp His Lys Lys Trp Gin Phe Asn Ser Pro Phe Val Pro 

565 570 575 

Arg Ala Asp Glu Pro Ala Arg Lys Gly Lys Val His lie Pro Phe Pro 
580 585 590 

Leu Asp Asn lie Thr Cys Arg Val Pro Met Ala Arg Glu Pro Thr Val 
595 600 605 

lie His Gly Lys Arg Glu Val Thr Leu His Leu His Pro Asp His Pro 
610 615 620 

Thr Leu Phe Ser Tyr Arg Thr Leu Gly Glu Asp Pro Gin Tyr His Glu 
625 630 635 640 

Glu Trp Val Thr Ala Ala Val Glu Arg Thr lie Pro Val Pro Val Asp 
«f 645 650 655 

Gl3y Met Glu Tyr His Trp Gly Asn Asn Asp Pro Val Arg Leu Trp Ser 
660 665 670 

Gpjn Leu Thr Thr Glu Gly Lys Pro His Gly Trp Pro His Gin lie Val 
01 675 680 685 

Qin Tyr Tyr Tyr Gly Leu Tyr Pro Ala Ala Thr Val Ser Ala Val Val 
i] 690 695 700 



P ^ Met Ser Leu Leu Ala Leu lie Ser lie Phe Ala Ser Cys Tyr Met 
7jSj5 710 -715 720 

Lieu Val Ala Ala Arg Ser Lys Cys Leu Thr Pro Tyr Ala Leu Thr Pro 

725 730 735 

Gly Ala Ala Val Pro Trp Thr Leu Gly lie Leu Cys Cys Ala Pro Arg 
740 745 750 

Ala His Ala Ala Ser Val Ala Glu Thr Met Ala Tyr Leu Trp Asp Gin 
755 760 765 

Asn Gin Ala Leu Phe Trp Leu Glu Phe Ala Ala Pro Val Ala Cys lie 
770 775 780 

Leu lie lie Thr Tyr Cys Leu Arg Asn Val Leu Cys Cys Cys Lys Ser 
785 790 795 800 

Leu Ser Phe Leu Val Leu Leu Ser Leu Gly Ala Thr Ala Arg Ala Tyr 

805 810 815 
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Glu His Ser Thr Val Met Pro Asn Val Val Gly Phe Pro Tyr Lys Ala 
820 825 830 

Mis lie Glu Arg Pro Gly Tyr Ser Pro Leu Thr Leu Gin Met Gin Val 
835 840 845 

Val Glu Thr Ser Leu Glu Pro Thr Leu Asn Leu Glu Tyr lie Thr Cys 
850 855 860 

Glu Tyr Lys Thr Val Val Pro Ser Pro Tyr Val Lys Cys Cys Gly Ala 
865 870 875 880 

Ser Glu Cys Ser Thr Lys Glu Lys Pro Asp Tyr Gin Cys Lys Val Tyr 

885 890 895 

Thr Gly Val Tyr Pro Phe Met Trp Gly Gly Ala Tyr Cys Phe Cys Asp 
900 905 910 

S| : r Glu Asn Thr Gin Leu Ser Glu Ala Tyr Val Asp Arg Ser Asp Val 
]l 915 920 925 

Cj| Arg His Asp His Ala Ser Ala Tyr Lys Ala His Thr Ala Ser Leu 
[I 930 935 940 

L|*g Ala Lys Val Arg Val Met Tyr Gly Asn Val Asn Gin Thr Val Asp 
9¥5 950 955 960 

Vfl Tyr Val Asn Gly Asp His Ala Val Thr lie Gly Gly Thr Gin Phe 
V J 965 970 975 



life Phe Gly Pro Leu Ser Ser Ala Trp Thr Pro Phe Asp Asn Lys lie 
O 980 985 990 

Val Val Tyr Lys Asp Glu Val Phe Asn Gin Asp Phe Pro Pro Tyr Gly 
995 1000 1005 

Ser Gly Gin Pro Gly Arg Phe Gly Asp lie Gin Ser Arg Thr Val Glu 
1010 1015 1020 

Ser Asn Asp Leu Tyr Ala Asn Thr Ala Leu Lys Leu Ala Arg Pro Ser 
1025 1030 1035 1040 

Pro Gly Met Val His Val Pro Tyr Thr Gin Thr Pro Ser Gly Phe Lys 

1045 1050 1055 

Tyr Trp Leu Lys Glu Lys Gly Thr Ala Leu Asn Thr Lys Ala Pro Phe 
1060 1065 1070 

Gly Cys Gin lie Lys Thr Asn Pro Val Arg Ala Met Asn Cys Ala Val 
1075 1080 1085 
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Gly Asn lie Pro Val Ser Met Asn Leu Pro Asp Ser Ala Phe Thr Arg 
1090 1095 1100 

lie Val Glu Ala Pro Thr lie lie Asp Leu Thr Cys Thr Val Ala Thr 
1105 1110 1115 1120 

"Cys Thr His Ser Ser Asp Phe Gly Gly Val Leu Thr Leu Thr Tyr Lys 

1125 1130 1135 

Thr Asn Lys Asn Gly Asp Cys Ser Val His Ser His Ser Asn Val Ala 
1140 1145 1150 

Thr Leu Gin Glu Ala Thr Ala Lys Val Lys Thr Ala Gly Lys Val Thr 
1155 1160 1165 

Leu His Phe Ser Thr Ala Ser Ala Ser Pro Ser Phe Val Val Ser Leu 
1170 1175 1180 

<5|s Ser Ala Arg Ala Thr Cys Ser Ala Ser Cys Glu Pro Pro Lys Asp 
£185 1190 1195 1200 

£tts lie Val Pro Tyr Ala Ala Ser His Ser Asn Val Val Phe Pro Asp 
["'" 1205 1210 1215 

rt|t Ser Gly Thr Ala Leu Ser Trp Val Gin Lys He Ser Gly Gly Leu 
^ 1220 1225 1230 

(fly Ala Phe Ala He Gly Ala He Leu Val Leu Val Val Val Thr Cys 
SJ 1235 1240 1245 

lie Gly Leu Arg Arg 
O 1250 

\2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE : NO 
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(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1 . . 115 

(D) OTHER INFORMATION: /label- 26S_region 

/note= "26S promoter and transcription start and 
proximal downstream region of pSFVl ; Figure 8 . " 

(ix) FEATURE: 

(A) NAME /KEY: misc_f eature 

(B) LOCATION: 1..24 

(D) OTHER INFORMATION: /product- "26S promoter region" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

ACCTCTACGG CGGTCCTAGA TTGGTGCGTT AATACACAGA ATCTGATTGG ATCCCGGGTA 60 

A^TAATTGAA TTACATCCCT ACGCAAACGT TTTACGGCCG CCGGTGGCGC CCGCG 115 

(§) INFORMATION FOR SEQ ID NO: 5: 

C (i) SEQUENCE CHARACTERISTICS : 
H (A) LENGTH: 127 base pairs 

H= (B) TYPE: nucleic acid 

O (C) STRANDEDNESS : single 

§1 (D) TOPOLOGY: linear 

p (ii) MOLECULE TYPE: RNA (genomic) 

l2 (iii) HYPOTHETICAL: NO 

U (iv) ANTI- SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..127 

(D) OTHER INFORMATION: /label- 26S_region 

/note= "26S promoter and transcription start and 
proximal downstream region of pSFV2 ; Figure 8 . " 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..24 

(D) OTHER INFORMATION: /product- "26S promoter region" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ACCTCTACGG CGGTCCTAGA TTGGTGCGTT AATACACAGA ATTCTGATTA TAGCGCACTA 60 
TTATATAGCA CCGGATCCCG GGTAATTAAT TGACGCAAAC GTTTTACGGC CGCCGGTGGC 120 
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GCCCGCG 127 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:^ 123 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



FEATURE : 

(A) NAME/KEY: - 

(B) LOCATION: 1. .123 

(D) OTHER INFORMATION: /label= 26S_region 

/note= n 26S promoter and transcription start and 
proximal downstream region of pSFV3 ; Figure 8 . " 

FEATURE : 

(A) NAME/KEY: misc__f eature 

(B) LOCATION: 1 . . 24 

(D) OTHER INFORMATION: /product= "26S promoter region" 



f" (xi) SEQUENCE DESCRIPTION: SEQ ID NO : G : 

McCTCTACGG CGGTCCTAGA TTGGTGCGTT AATACACAGA ATTCTGATTA TAGCGCACTA 60 
TTATATAGCA CCATGGATCC CGGGTAATTA ATTGACGTTT TACGGCCGCC GGTGGCGCCC 120 
GCG 123 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 
(iii) HYPOTHETICAL: NO 





(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Semliki Forest Virus 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. .54 

(D) OTHER INFORMATION: /label- restrict_site 

/note= "sequence of SFV E2 genome in vicinity of Bam HI site 
in SFV vector E2 ; Figure 12 . " 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .54 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

AAC TCA CCT TTC GTC CCG AGA GCC GAC GAA CCG GCT AGA AAA GGC AAA 4 8 

Asn Ser Pro Phe Val Pro Arg Ala Asp Glu Pro Ala Arg Lys Gly Lys 
15 10 15 

G£C CAT 54 
vkl His 



(fe) INFORMATION FOR SEQ ID NO : 8 : 

O (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
s (B) TYPE: amino acid 

13 (D) TOPOLOGY: linear 

U (ii) MOLECULE TYPE: protein 

D 

p (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

raw:- 

Asn Ser Pro Phe Val Pro Arg Ala Asp Glu Pro Ala Arg Lys Gly Lys 
15 10 15 

Val His 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE : NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HIV 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..46 

(D) OTHER INFORMATION: /label= fragment 

/note= "HIV gpl20 epitope introduced into SFV 

vector E2 ; Figure 12 . " 

(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GA;T CCG CGT ATC CAG AGA GGA CCA GGA AGA GCA TTT GTT GAG CTA 
Alp Pro Arg He Gin Arg Gly Pro Gly Arg Ala Phe Val Glu Leu 



(|?) INFORMATION FOR SEQ ID NO: 10: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Pro Arg He Gin Arg Gly Pro Gly Arg Ala Phe Val Glu Leu 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1. .51 

(D) OTHER INFORMATION: /label= chimaeric_seq 

/note= "SFV-HIV chimaeric sequence shown in Figure 
12. \ 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .51 

(D) OTHER INFORMATION: /product= "SFV-HIV chimaeric 
sequence' 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GAG GAT CCG CGT ATC CAG AGA GGA CCA GGA AGA GCA TTT GTT GAG GAT 
(3Au Asp Pro Arg lie Gin Arg Gly Pro Gly Arg Ala Phe Val Glu Asp 
% 1 5 10 15 

&G 
Hro 



W2) INFORMATION FOR SEQ ID NO: 12: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 i near 

MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Glu Asp Pro Arg lie Gin Arg Gly Pro Gly Arg Ala Phe Val Glu Asp 
15 10 15 

Pro 



(i) 



(ii) 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. .60 

(D) OTHER INFORMATION: /label- oligonucleotide 

/note= "used to introduce new linker site" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CGGCCAGTGA ATTCTGATTG GATCCCGGGT AATTAATTGA ATTACATCCC TACGCAAACG 60 



M) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 
PI (A) LENGTH: 62 base pairs 

rf (B) TYPE: nucleic acid 

r (C) STRANDEDNESS : single 

t: (D) TOPOLOGY: linear 

0" ; (ii) MOLECULE TYPE: DNA (genomic) 

Vj (iii) HYPOTHETICAL: NO 

y> (iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..62 

(D) OTHER INFORMATION: /label= oligonucleotide 

/note= "used to introduce new linker site" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCGCACTATT ATAGCACCGG CTCCCGGGTA ATTAATTGAC GCAAACGTTT TACGGCCGCC 60 
GG 62 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1 . . 62 

(D) OTHER INFORMATION: /label- oligonucleotide 

/note= "used to introduce new linker site" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 
GCGCACTATT ATAGCACCAT GGATCCGGGT AATTAATTGA CGTTTTACGG CCGCCGGTGG 

fG 

1(2) INFORMATION FOR SEQ ID NO: 16: 

J"? (i) SEQUENCE CHARACTERISTICS: 
£: (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
B'= (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

S 1 (ii) MOLECULE TYPE: DNA (genomic) 
p. (iii) HYPOTHETICAL: NO 
U (iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /label- primer 

/note= "SP1 upstream sequencing primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGGCGGTCCT AGATTGGTGC G 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



93 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /label= primer 

/note= "SP2 downstream sequencing primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
<SPCGGGCGCC ACCGGCGGCC G 

□ ) INFORMATION FOR SEQ ID NO: 18: 

L (i) SEQUENCE CHARACTERISTICS: 
h (A) LENGTH: 21 base pairs 

S (B) TYPE: nucleic acid 

" (C) STRANDEDNESS : single 

U (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

B (iii) HYPOTHETICAL: NO 

H : (iv) ANTI-SENSE: YES 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. .21 

(D) OTHER INFORMATION: /labels primer 

/note= M primer-1 for first strand cDNA synthes 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTTCTCGTAG TTCTCCTCGT C 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: -YES 



(ix) FEATURE: 

(A) NAME / KEY : - 

(B) LOCATION: 1 . .27 

(D) OTHER INFORMATION: /label= primer 

/note= "primer- 2 for first strand cDNA synthes 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GTTATCCCAG TGGTTGTTCT CGTAATA 

(#) INFORMATION FOR SEQ ID NO: 20: 

h (i) SEQUENCE CHARACTERISTICS: 
Ik-- (A) LENGTH: 28 base pairs 

O (B) TYPE: nucleic acid 

fT> (C) STRANDEDNESS : single 

I' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

J! (iii) HYPOTHETICAL: NO 

;P (iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1. .28 

(D) OTHER INFORMATION: /label= primer 

/note= "5' most primer for second strand cDNA 
synthesis, equals bp 1-28 of SFV sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
ATGGCGGATG TGTGACATAC ACGACGCC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..46 

(D) OTHER INFORMATION: /label= adaptor 
/note= "5' -sticky end 

(EcoRI-Hindlll-Notl-Xmalll-Spel) blunt end-3' 
adaptor" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ASTTCAAGCT TGCGGCCGCA CTAGTGTTCG AACGCCGGCG TGATCA 
U) INFORMATION FOR SEQ ID NO: 22: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA (genomic) 
HYPOTHETICAL: NO 
ANTI -SENSE: NO 



FEATURE : 

(A) NAME/KEY: - 

(B) LOCATION: 1..8 

(D) OTHER INFORMATION: /label- oligonucleotide 
/note= "Ncol oligonucleotide" 



SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCCATGGC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 



(i) 



£ (iii) 
H : (iv) 

(ix) 



(xi) 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL; NO 

(iv) ANT I- SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. .20 

(D) OTHER INFORMATION: /label- oligonucleotide 

/note= "oligonucleotide used for screening by 
colony hybridization" 

© <xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

GSlTGACACTA TAGCCATGGC 

™...,. 

(M INFORMATION FOR SEQ ID NO: 24: 

S (i) SEQUENCE CHARACTERISTICS: 
I" (A) LENGTH: 24 base pairs 

fi (B) TYPE: nucleic acid 

Tl (C) STRANDEDNESS: single 

I? (D) TOPOLOGY: linear 

fc: (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE : NO 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. .24 

(D) OTHER INFORMATION: /label= oligonucleotide 

/note= "site-directed mutagenic oligonucleotide 
used to introduce a BamHI site into the SFV 
genome 11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GATCGGCCTA GGAGCCGAGA GCCC 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Semliki Forest Virus 

(ix) FEATURE: 

B (A) NAME /KEY : - 

II (B) LOCATION: 1. .80 

ji (D) OTHER INFORMATION: /label= terminator 

D /note= "3' terminal sequence of cDNA expression 

H= vector complementary to alphavirus genomic RNA" 



ff. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

iTTCCAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 60 
^AAAAAAAA AAAAACTAGT 80 



fp) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: RNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Semliki Forest Virus 
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(ix) FEATURE : 

(A) NAME /KEY: - 

(B) LOCATION: 1. .54 

(D) OTHER INFORMATION: /label= restrict_site 

/note= "sequence of SFV vector E2 in vicinity of Bam HI site; 
Figure , 12 . " 

(ix) FEATURE: 

(A) NAME /KEY : mutation 

(B) LOCATION: 27.-32 

(D) OTHER INFORMATION: /label= restrict ion_sit 

/note= "BaraHI recognition sequence introduced into 
SFV E2 genome in SFV vector E2 . " 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . . 54 



V (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Me TCA CCT TTC GTC CCG AGA GCC GAG GAT CCG GCT AGA AAA GGC AAA 48 
Mn Ser Pro Phe Val Pro Arg Ala Glu Asp Pro Ala Arg Lys Gly Lys 

Hi 5 10 15 



(SIC CAT 54 
Val His 

%- if 

0) INFORMATION FOR SEQ ID NO: 27: 

m (i) SEQUENCE CHARACTERISTICS: 

rf (A) LENGTH: 18 amino acids 

f ' = (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Asn Ser Pro Phe Val Pro Arg Ala Glu Asp Pro Ala Arg Lys Gly Lys 
15 10 15 



Val His 



